Search CORE

107 research outputs found

Geo-Information Harvesting from Social Media Data

Author: Abdulahhad Karam
Hoffmann Eike Jens
Häberle Matthias
Jacobs Nathan
Kochupillai Mrinalini
Kruspe Anna
Levering Alex
Taubenböck Hannes
Tuia Devis
Wang Yuanyuan
Werner Martin
Zhu Xiao Xiang
Publication venue
Publication date: 01/01/2022
Field of study

As unconventional sources of geo-information, massive imagery and text messages from open platforms and social media form a temporally quasi-seamless, spatially multi-perspective stream, but with unknown and diverse quality. Due to its complementarity to remote sensing data, geo-information from these sources offers promising perspectives, but harvesting is not trivial due to its data characteristics. In this article, we address key aspects in the field, including data availability, analysis-ready data preparation and data management, geo-information extraction from social media text messages and images, and the fusion of social media and remote sensing data. We then showcase some exemplary geographic applications. In addition, we present the first extensive discussion of ethical considerations of social media data in the context of geo-information harvesting and geographic applications. With this effort, we wish to stimulate curiosity and lay the groundwork for researchers who intend to explore social media data for geo-applications. We encourage the community to join forces by sharing their code and data.Comment: Accepted for publication IEEE Geoscience and Remote Sensing Magazin

arXiv.org e-Print Archive

Institute of Transport Research:Publications

Social media analytics – Challenges in topic discovery, data collection, and data preparation

Author: Abbasi
Al-Qurishi
Alsubaiee
Alsudais
Anderson
Aral
Artikis
Baars
Beier
Bem
Bendler
Bhattacharya
Bi
Bindra
Björn Ross
Blegind
Blei
boyd
Bruns
Cao
Cao
Carr
Chae
Chang
Chen
Chen
Chen
Chen
Chinnov
Christoph Neuberger
Cossu
Demchenko
Diaz
Driscoll
Fan
Garcia
Griffiths
Guellil
Hargittai
Hernandez
Hofmann
Howe
Hu
Huang
Huang
Immonen
Jalonen
Japec
Ji
Johannessen
Jungherr
Jungherr
Kalhour
Kane
Kaplan
Kim
King
Kitchin
Lin
Liu
Liu
Liu
Lukoianova
Mahrt
McAfee
Milad Mirbabaie
Mirbabaie
Morstatter
Nulty
Payton
Pletikosa Cvijikj
Pletikosa Cvijikj
Qian
Qin
Rehman
Ruths
Schober
Shah
Stefan Stieglitz
Stieglitz
Stieglitz
Stieglitz
Stieglitz
Stieglitz
Susarla
Vavliakis
Venkatesh
Webster
Weiler
Yang
Zeng
Publication venue: 'Elsevier BV'
Publication date: 30/04/2018
Field of study

Crossref

Edinburgh Research Explorer

From Sensor to Observation Web with Environmental Enablers in the Future Internet

Author: Berre Arene J.
Havlik Denis
Lorenzo Mon Jose
Mazzetti Paolo
Sabeur Zoheir
Schade Sven
Watson Kym
Publication venue
Publication date: 01/01/2011
Field of study

This paper outlines the grand challenges in global sustainability research and the objectives of the FP7 Future Internet PPP program within the Digital Agenda for Europe. Large user communities are generating significant amounts of valuable environmental observations at local and regional scales using the devices and services of the Future Internet. These communities’ environmental observations represent a wealth of information which is currently hardly used or used only in isolation and therefore in need of integration with other information sources. Indeed, this very integration will lead to a paradigm shift from a mere Sensor Web to an Observation Web with semantically enriched content emanating from sensors, environmental simulations and citizens. The paper also describes the research challenges to realize the Observation Web and the associated environmental enablers for the Future Internet. Such an environmental enabler could for instance be an electronic sensing device, a web-service application, or even a social networking group affording or facilitating the capability of the Future Internet applications to consume, produce, and use environmental observations in cross-domain applications. The term ?envirofied? Future Internet is coined to describe this overall target that forms a cornerstone of work in the Environmental Usage Area within the Future Internet PPP program. Relevant trends described in the paper are the usage of ubiquitous sensors (anywhere), the provision and generation of information by citizens, and the convergence of real and virtual realities to convey understanding of environmental observations. The paper addresses the technical challenges in the Environmental Usage Area and the need for designing multi-style service oriented architecture. Key topics are the mapping of requirements to capabilities, providing scalability and robustness with implementing context aware information retrieval. Another essential research topic is handling data fusion and model based computation, and the related propagation of information uncertainty. Approaches to security, standardization and harmonization, all essential for sustainable solutions, are summarized from the perspective of the Environmental Usage Area. The paper concludes with an overview of emerging, high impact applications in the environmental areas concerning land ecosystems (biodiversity), air quality (atmospheric conditions) and water ecosystems (marine asset management)

Southampton (e-Prints Soton)

Directory of Open Access Journals

Fraunhofer-ePrints

PubMed Central

Bournemouth University Research Online

Geo-Information Harvesting from Social Media Data

Author: Abdulahhad Karam
Hoffmann Eike Jens
Häberle Matthias
Jacobs Nathan
Kochupillai Mrinalini
Kruspe Anna
Levering Alex
Taubenböck Hannes
Tuia Devis
Wang Yuanyuan
Werner Martin
Zhu Xiao Xiang
Publication venue: IEEE - Institute of Electrical and Electronics Engineers
Publication date: 01/01/2023
Field of study

As unconventional sources of geo-information, massive imagery and text messages from open platforms and social media form a temporally quasi-seamless, spatially multiperspective stream, but with unknown and diverse quality. Due to its complementarity to remote sensing data, geo-information from these sources offers promising perspectives, but harvesting is not trivial due to its data characteristics. In this article, we address key aspects in the field, including data availability, analysisready data preparation and data management, geo-information extraction from social media text messages and images, and the fusion of social media and remote sensing data. We then showcase some exemplary geographic applications. In addition, we present the first extensive discussion of ethical considerations of social media data in the context of geo-information harvesting and geographic applications. With this effort, we wish to stimulate curiosity and lay the groundwork for researchers who intend to explore social media data for geo-applications. We encourage the community to join forces by sharing their code and data

Institute of Transport Research:Publications

Visual analytics of location-based social networks for decision support

Author: Chae Junghoon
Publication venue: 'Purdue University (bepress)'
Publication date: 01/01/2016
Field of study

Recent advances in technology have enabled people to add location information to social networks called Location-Based Social Networks (LBSNs) where people share their communication and whereabouts not only in their daily lives, but also during abnormal situations, such as crisis events. However, since the volume of the data exceeds the boundaries of human analytical capabilities, it is almost impossible to perform a straightforward qualitative analysis of the data. The emerging field of visual analytics has been introduced to tackle such challenges by integrating the approaches from statistical data analysis and human computer interaction into highly interactive visual environments. Based on the idea of visual analytics, this research contributes the techniques of knowledge discovery in social media data for providing comprehensive situational awareness. We extract valuable hidden information from the huge volume of unstructured social media data and model the extracted information for visualizing meaningful information along with user-centered interactive interfaces. We develop visual analytics techniques and systems for spatial decision support through coupling modeling of spatiotemporal social media data, with scalable and interactive visual environments. These systems allow analysts to detect and examine abnormal events within social media data by integrating automated analytical techniques and visual methods. We provide comprehensive analysis of public behavior response in disaster events through exploring and examining the spatial and temporal distribution of LBSNs. We also propose a trajectory-based visual analytics of LBSNs for anomalous human movement analysis during crises by incorporating a novel classification technique. Finally, we introduce a visual analytics approach for forecasting the overall flow of human crowds

Purdue E-Pubs

Sequential assimilation of crowdsourced social media data into a simplified flood inundation model

Author: Songchon Chanin
Publication venue: Energy, Geoscience, Infrastructure and Society
Publication date: 01/04/2023
Field of study

Flooding is the most common natural hazard worldwide. Severe floods can cause significant damage and sometimes loss of life. During a flood event, hydraulic models play an important role in forecasting and identifying potential inundated areas, where emergency responses should be deployed. Nevertheless, hydraulic models are not able to capture all of the processes in flood propagation because flood behaviour is highly dynamic and complex. Thus, there are always uncertainties associated with model simulations. As a result, near-real time observations are required to incorporate with hydraulic models to improve model forecasting skills. Crowdsourced (CS) social media data presents an opportunity for supporting urban flood management as it can provide insightful information collected by individuals in near real-time. In this thesis, approachesto maximise the impact of CS social media data (Twitter) to reduce uncertainty in flood inundation modelling (LISFLOOD-FP) through data assimilation were investigated. The developed methodologies were tested and evaluated using a real flooding case study of Phetchaburi city, Thailand. Firstly, two approaches (binary logistic regression and fuzzy logic) were developed based on Twitter metadata and spatiotemporal analysis to assess the quality of CS social media data. Both methods produced good results, but the binary logistic model was preferred as it involved less subjectivity. Next, the generalized likelihood uncertainty estimation methodology was applied to estimate model uncertainty and identify behavioural parameter ranges. Particle swarm optimisation was also carried out to calibrate for an optimum model parameter set. Following this, an ensemble Kalman filter was applied to assimilate the flood depth information extracted from the CS data into the LISFLOOD-FP simulations using various updating strategies. The findings show that the global state update suffers from inconsistency of predicted water levels due to overestimating the impact of the CS data, whereas a topography based local state update provides encouraging results as the uncertainty in model forecasts narrows, albeit for a short time period. To extend the improvement time span, a combination of state and boundary updating was further investigated to correct both water levels and model inputs, and was found to produce longer lasting improvements in terms of uncertainty reduction. Overall, the results indicate the feasibility of applying CS social media data to reduce model uncertainty in flood forecasting

ROS: The Research Output Service. Heriot-Watt University Edinburgh

Development and Applications of Similarity Measures for Spatial-Temporal Event and Setting Sequences

Author: Xu Fuyu
Publication venue: DigitalCommons@UMaine
Publication date: 05/05/2023
Field of study

Similarity or distance measures between data objects are applied frequently in many fields or domains such as geography, environmental science, biology, economics, computer science, linguistics, logic, business analytics, and statistics, among others. One area where similarity measures are particularly important is in the analysis of spatiotemporal event sequences and associated environs or settings. This dissertation focuses on developing a framework of modeling, representation, and new similarity measure construction for sequences of spatiotemporal events and corresponding settings, which can be applied to different event data types and used in different areas of data science. The first core part of this dissertation presents a matrix-based spatiotemporal event sequence representation that unifies punctual and interval-based representation of events. This framework supports different event data types and provides support for data mining and sequence classification and clustering. The similarity measure is based on the modified Jaccard index with temporal order constraints and accommodates different event data types. This approach is demonstrated through simulated data examples and the performance of the similarity measures is evaluated with a k-nearest neighbor algorithm (k-NN) classification test on synthetic datasets. These similarity measures are incorporated into a clustering method and successfully demonstrate the usefulness in a case study analysis of event sequences extracted from space time series of a water quality monitoring system. This dissertation further proposes a new similarity measure for event setting sequences, which involve the space and time in which events occur. While similarity measures for spatiotemporal event sequences have been studied, the settings and setting sequences have not yet been considered. While modeling event setting sequences, spatial and temporal scales are considered to define the bounds of the setting and incorporate dynamic variables along with static variables. Using a matrix-based representation and an extended Jaccard index, new similarity measures are developed to allow for the use of all variable data types. With these similarity measures coupled with other multivariate statistical analysis approaches, results from a case study involving setting sequences and pollution event sequences associated with the same monitoring stations, support the hypothesis that more similar spatial-temporal settings or setting sequences may generate more similar events or event sequences. To test the scalability of STES similarity measure in a larger dataset and an extended application in different fields, this dissertation compares and contrasts the prospective space-time scan statistic with the STES similarity approach for identifying COVID-19 hotspots. The COVID-19 pandemic has highlighted the importance of detecting hotspots or clusters of COVID-19 to provide decision makers at various levels with better information for managing distribution of human and technical resources as the outbreak in the USA continues to grow. The prospective space-time scan statistic has been used to help identify emerging disease clusters yet results from this approach can encounter strategic limitations imposed by the spatial constraints of the scanning window. The STES-based approach adapted for this pandemic context computes the similarity of evolving normalized COVID-19 daily cases by county and clusters these to identify counties with similarly evolving COVID-19 case histories. This dissertation analyzes the spread of COVID-19 within the continental US through four periods beginning from late January 2020 using the COVID-19 datasets maintained by John Hopkins University, Center for Systems Science and Engineering (CSSE). Results of the two approaches can complement with each other and taken together can aid in tracking the progression of the pandemic. Overall, the dissertation highlights the importance of developing similarity measures for analyzing spatiotemporal event sequences and associated settings, which can be applied to different event data types and used for data mining, sequence classification, and clustering

University of Maine

A Pattern Approach to Examine the Design Space of Spatiotemporal Visualization

Author: Guo Chen
Publication venue: 'Purdue University (bepress)'
Publication date: 01/01/2017
Field of study

Pattern language has been widely used in the development of visualization systems. This dissertation applies a pattern language approach to explore the design space of spatiotemporal visualization. The study provides a framework for both designers and novices to communicate, develop, evaluate, and share spatiotemporal visualization design on an abstract level. The touchstone of the work is a pattern language consisting of fifteen design patterns and four categories. In order to validate the design patterns, the researcher created two visualization systems with this framework in mind. The first system displayed the daily routine of human beings via a polygon-based visualization. The second system showed the spatiotemporal patterns of co-occurring hashtags with a spiral map, sunburst diagram, and small multiples. The evaluation results demonstrated the effectiveness of the proposed design patterns to guide design thinking and create novel visualization practices

Purdue E-Pubs

Potential Indirect Relationships in Productive Networks

Author: Sabino André Miguel Guedelha
Publication venue
Publication date: 01/12/2016
Field of study

Productive Networks, such as Social Networks Services, organize evidence about human behavior. This evidence is independent of the network content type, and may support the discovery of new relationships between users and content, or with other users. These indirect relationships are important for recommendation systems, and systems where potential relationships between users and content (e.g., locations) is relevant, such as with the emergency management domain, where the discovery of relationships between users and locations on productive networks may enable the identification of population density variations, increasing the accuracy of emergency alerts. This thesis presents a Productive Networks model, which enables the development of a methodology for indirect relationships discovery, using the metadata on the network, and avoiding the computational cost of content analysis. We designed and conducted a set of experiments to evaluate our proposals. Our results are twofold: firstly, the productive network model is sufficiently robust to represent a wide range of networks; secondly, the indirect relationship discovery methodology successfully identifies relevant relationships between users and content. We also present applications of the model and methodology in several contexts

Repositório da Universidade Nova de Lisboa