1,230 research outputs found

    Enhancing Exploratory Analysis across Multiple Levels of Detail of Spatiotemporal Events

    Get PDF
    Crimes, forest fires, accidents, infectious diseases, human interactions with mobile devices (e.g., tweets) are being logged as spatiotemporal events. For each event, its spatial location, time and related attributes are known with high levels of detail (LoDs). The LoD of analysis plays a crucial role in the user’s perception of phenomena. From one LoD to another, some patterns can be easily perceived or different patterns may be detected, thus requiring modeling phenomena at different LoDs as there is no exclusive LoD to study them. Granular computing emerged as a paradigm of knowledge representation and processing, where granules are basic ingredients of information. These can be arranged in a hierarchical alike structure, allowing the same phenomenon to be perceived at different LoDs. This PhD Thesis introduces a formal Theory of Granularities (ToG) in order to have granules defined over any domain and reason over them. This approach is more general than the related literature because these appear as particular cases of the proposed ToG. Based on this theory we propose a granular computing approach to model spatiotemporal phenomena at multiple LoDs, and called it a granularities-based model. This approach stands out from the related literature because it models a phenomenon through statements rather than just using granules to model abstract real-world entities. Furthermore, it formalizes the concept of LoD and follows an automated approach to generalize a phenomenon from one LoD to a coarser one. Present-day practices work on a single LoD driven by the users despite the fact that the identification of the suitable LoDs is a key issue for them. This PhD Thesis presents a framework for SUmmarizIng spatioTemporal Events (SUITE) across multiple LoDs. The SUITE framework makes no assumptions about the phenomenon and the analytical task. A Visual Analytics approach implementing the SUITE framework is presented, which allow users to inspect a phenomenon across multiple LoDs, simultaneously, thus helping to understand in what LoDs the phenomenon perception is different or in what LoDs patterns emerge

    Identifying Hidden Visits from Sparse Call Detail Record Data

    Full text link
    Despite a large body of literature on trip inference using call detail record (CDR) data, a fundamental understanding of their limitations is lacking. In particular, because of the sparse nature of CDR data, users may travel to a location without being revealed in the data, which we refer to as a "hidden visit". The existence of hidden visits hinders our ability to extract reliable information about human mobility and travel behavior from CDR data. In this study, we propose a data fusion approach to obtain labeled data for statistical inference of hidden visits. In the absence of complementary data, this can be accomplished by extracting labeled observations from more granular cellular data access records, and extracting features from voice call and text messaging records. The proposed approach is demonstrated using a real-world CDR dataset of 3 million users from a large Chinese city. Logistic regression, support vector machine, random forest, and gradient boosting are used to infer whether a hidden visit exists during a displacement observed from CDR data. The test results show significant improvement over the naive no-hidden-visit rule, which is an implicit assumption adopted by most existing studies. Based on the proposed model, we estimate that over 10% of the displacements extracted from CDR data involve hidden visits. The proposed data fusion method offers a systematic statistical approach to inferring individual mobility patterns based on telecommunication records

    CAPTURE AND ANALYSIS OF SENSOR DATA FOR ASTHMA PATIENTS

    Get PDF
    Worldwide more than 230 million people suffer from asthma. Reliable and timely guidance for indi-viduals to minimize their risk for asthma attacks is not available. This is largely due to the fact that asthma symptoms are often caused by multiple environmental and personal factors. Many of them are neither captured nor systematically analysed. This is addressed by the project ActOnAir. It aims at a comprehensive capture of health factors and the environmental exposure of individuals, as well as a subsequent analysis in real-time. For this purpose the ActOnAir system provides a mobile sensor box for data collection, a sensor data integration and processing platform, a data mining component and a smartphone application for patients. This contribution outlines the design objectives of the ActOnAir system and discusses corresponding key requirements. The related system architecture is introduced and first results from a prototype implementation are sketched

    Sequential movement pattern-mining (SMP) in field-based team-sport: A framework for quantifying spatiotemporal data and improve training specificity?

    Get PDF
    Athlete external load is typically quantified as volumes or discretised threshold values using distance, speed and time. A framework accounting for the movement sequences of athletes has previously been proposed using radio frequency data. This study developed a framework to identify sequential movement sequences using GPS-derived spatiotemporal data in team-sports and establish its stability. Thirteen rugby league players during one match were analysed to demonstrate the application of the framework. The framework (Sequential Movement Pattern-mining [SMP]) applies techniques to analyse i) geospatial data (i.e., decimal degree latitude and longitude), ii) determine players turning angles, iii) improve movement descriptor assignment, thus improving movement unit formation and iv) improve the classification and identification of players’ frequent SMP. The SMP framework allows for sub-sequences of movement units to be condensed, removing repeated elements, which offers a novel technique for the quantification of similarities or dis-similarities between players and playing positions. The SMP framework provides a robust and stable method that allows, for the first time the analysis of GPS-derived data and identifies the frequent SMP of field-based team-sport athletes. The application of the SMP framework in practice could optimise the outcomes of training of field-based team-sport athletes by improving training specificity

    Data-Driven Framework for Understanding & Modeling Ride-Sourcing Transportation Systems

    Get PDF
    Ride-sourcing transportation services offered by transportation network companies (TNCs) like Uber and Lyft are disrupting the transportation landscape. The growing demand on these services, along with their potential short and long-term impacts on the environment, society, and infrastructure emphasize the need to further understand the ride-sourcing system. There were no sufficient data to fully understand the system and integrate it within regional multimodal transportation frameworks. This can be attributed to commercial and competition reasons, given the technology-enabled and innovative nature of the system. Recently, in 2019, the City of Chicago the released an extensive and complete ride-sourcing trip-level data for all trips made within the city since November 1, 2018. The data comprises the trip ends (pick-up and drop-off locations), trip timestamps, trip length and duration, fare including tipping amounts, and whether the trip was authorized to be shared (pooled) with another passenger or not. Therefore, the main goal of this dissertation is to develop a comprehensive data-driven framework to understand and model the system using this data from Chicago, in a reproducible and transferable fashion. Using data fusion approach, sociodemographic, economic, parking supply, transit availability and accessibility, built environment and crime data are collected from open sources to develop this framework. The framework is predicated on three pillars of analytics: (1) explorative and descriptive analytics, (2) diagnostic analytics, and (3) predictive analytics. The dissertation research framework also provides a guide on the key spatial and behavioral explanatory variables shaping the utility of the mode, driving the demand, and governing the interdependencies between the demand’s willingness to share and surge price. Thus, the key findings can be readily challenged, verified, and utilized in different geographies. In the explorative and descriptive analytics, the ride-sourcing system’s spatial and temporal dimensions of the system are analyzed to achieve two objectives: (1) explore, reveal, and assess the significance of spatial effects, i.e., spatial dependence and heterogeneity, in the system behavior, and (2) develop a behavioral market segmentation and trend mining of the willingness to share. This is linked to the diagnostic analytics layer, as the revealed spatial effects motivates the adoption of spatial econometric models to analytically identify the ride-sourcing system determinants. Multiple linear regression (MLR) is used as a benchmark model against spatial error model (SEM), spatially lagged X (SLX) model, and geographically weighted regression (GWR) model. Two innovative modeling constructs are introduced deal with the ride-sourcing system’s spatial effects and multicollinearity: (1) Calibrated Spatially Lagged X Ridge Model (CSLXR) and Calibrated Geographically Weighted Ridge Regression (CGWRR) in the diagnostic analytics layer. The identified determinants in the diagnostic analytics layer are then fed into the predictive analytics one to develop an interpretable machine learning (ML) modeling framework. The system’s annual average weekday origin-destination (AAWD OD) flow is modeled using the following state-of-the-art ML models: (1) Multilayer Perceptron (MLP) Regression, (2) Support Vector Machines Regression (SVR), and (3) Tree-based ensemble learning methods, i.e., Random Forest Regression (RFR) and Extreme Gradient Boosting (XGBoost). The innovative modeling construct of CGWRR developed in the diagnostic analytics is then validated in a predictive context and is found to outperform the state-of-the-art ML models in terms of testing score of 0.914, in comparison to 0.906 for XGBoost, 0.84 for RFR, 0.89 for SVR, and 0.86 for MLP. The CGWRR exhibits outperformance as well in terms of the root mean squared error (RMSE) and mean average error (MAE). The findings of this dissertation partially bridge the gap between the practice and the research on ride-sourcing transportation systems understanding and integration. The empirical findings made in the descriptive and explorative analytics can be further utilized by regional agencies to fill practice and policymaking gaps on regulating ride-sourcing services using corridor or cordon toll, optimally allocating standing areas to minimize deadheading, especially during off-peak periods, and promoting the ride-share willingness in disadvantage communities. The CGWRR provides a reliable modeling and simulation tool to researchers and practitioners to integrate the ride-sourcing system in multimodal transportation modeling frameworks, simulation testbed for testing long-range impacts of policies on ride-sourcing, like improved transit supply, congestions pricing, or increased parking rates, and to plan ahead for similar futuristic transportation modes, like the shared autonomous vehicles

    A spatiotemporal analysis of the impact of lockdown and coronavirus on London’s bicycle hire scheme: from response to recovery to a new normal

    Get PDF
    The coronavirus pandemic that started in 2019 has had wide-ranging impacts on many aspects of people’s daily lives. At the peak of the outbreak, lockdown measures and social distancing changed the ways in which cities function. In particular, they had profound impacts on urban transportation systems, with public transport being shut down in many cities. Bike share systems (BSS) were widely reported as having experienced an increase in demand during the early stages of the pandemic before returning to pre-pandemic levels. However, the studies published to date focus mainly on the first year of the pandemic, when various waves saw continual relaxing and reintroductions of restrictions. Therefore, they fall short of exploring the role of BSS as we move to the post-pandemic period. To address this gap, this study uses origin-destination (O-D) flow data from London’s Santander Cycle Hire Scheme from 2019–2021 to analyze the changing use of BSS throughout the first two years of the pandemic, from lockdown to recovery. A Gaussian mixture model (GMM) is used to cluster 2019 BSS trips into three distinct clusters based on their duration and distance. The clusters are used as a reference from which to measure spatial and temporal change in 2020 and 2021. In agreement with previous research, BSS usage was found to have declined by nearly 30% during the first lockdown. Usage then saw a sharp increase as restrictions were lifted, characterized by longer, less direct trips throughout the afternoon rather than typical peak commuting trips. Although the aggregate number of BSS trips appeared to return to normal by October 2020, this was against the backdrop of continuing restrictions on international travel and work from home orders. The period between July and December 2021 was the first period that all government restrictions were lifted. During this time, BSS trips reached higher levels than in 2019. Spatio-temporal analysis indicates a shift away from the traditional morning and evening peak to a more diffuse pattern of working hours. The results indicate that the pandemic may have had sustained impacts on travel behavior, leading to a “new normal” that reflects different ways of working
    • …
    corecore