26 research outputs found

    High accuracy context recovery using clustering mechanisms

    Get PDF
    This paper examines the recovery of user context in indoor environmnents with existing wireless infrastructures to enable assistive systems. We present a novel approach to the extraction of user context, casting the problem of context recovery as an unsupervised, clustering problem. A well known density-based clustering technique, DBSCAN, is adapted to recover user context that includes user motion state, and significant places the user visits from WiFi observations consisting of access point id and signal strength. Furthermore, user rhythms or sequences of places the user visits periodically are derived from the above low level contexts by employing state-of-the-art probabilistic clustering technique, the Latent Dirichiet Allocation (LDA), to enable a variety of application services. Experimental results with real data are presented to validate the proposed unsupervised learning approach and demonstrate its applicability.<br /

    An Unsupervised and Non-Invasive Model for Predicting Network Resource Demands

    Get PDF
    During the last decade, network providers are faced by a growing problem regarding the distribution of bandwidth and computing resources. Recently, the mobile edge computing paradigm was proposed as a possible solution, mainly in consideration of the provided possibility of transferring service demands at the edge of the network. This solution heavily relies on the dynamic allocation of resources, depending on the user needs and network connection, therefore it becomes essential to correctly predict user movements and activities. This paper proposes an unsupervised methodology to define meaningful user locations from noninvasive user information, captured by the user terminal with no computing or battery overhead. The data is analyzed through a conjoined clustering algorithm to build a stochastic Markov chain to predict the users’ movements and their bandwidth demands. Such a model could be used by network operators to optimize network resources allocation. To evaluate the proposed methodology, we tested it on one of the largest public community’s labeled mobile and sensor dataset, developed by the “CrowdSignals.io” initiative, and we present positive and promising results concerning the prediction capabilities of the model

    Detection of meaningful locations from passive mobile positioning data using location profiling

    Get PDF
    Mobile positioning data is a promising source for investigating people’s activity patterns. People regularly visit locations that have different functions to them. Locations with similar activity patterns can be distinguished from the data based on people’s calling activities. The problem with assigning meaning to these locations in the data is limited information about the person and access to ground truth data. The thesis proposes a method to profile locations and assign meanings to differently behaving location groups. In the course of the work, various features are added to the location points by means of which they are classified. Additionally, an expert’s opinion was considered to provide input for the classes

    Data from mobile phone operators: A tool for smarter cities?

    Get PDF
    Abstract The use of mobile phone data provides new spatio-temporal tools for improving urban planning, and for reducing inefficiencies in present-day urban systems. Data from mobile phones, originally intended as a communication tool, are increasingly used as innovative tools in geography and social sciences research. Empirical studies on complex city systems from human-centred and urban dynamics perspectives provide new insights to develop promising applications for supporting smart city initiatives. This paper provides a comprehensive review and a typology of spatial studies on mobile phone data, and highlights the applicability of such digital data to develop innovative applications for enhanced urban management

    Detecting Home Locations from CDR Data: Introducing Spatial Uncertainty to the State-of-the-Art

    Get PDF
    Non-continuous location traces inferred from Call Detail Records (CDR) at population scale are increasingly becoming available for research and show great potential for automated detection of meaningful places. Yet, a majority of Home Detection Algorithms (HDAs) suffer from “blind” deployment of criteria to define homes and from limited possibilities for validation. In this paper, we investigate the performance and capabilities of five popular criteria for home detection based on a very large mobile phone dataset from France (~18 million users, 6 months). Furthermore, we construct a data-driven framework to assess the spatial uncertainty related to the application of HDAs. Our findings appropriate spatial uncertainty in HDA and, in extension, for detection of meaningful places. We show how spatial uncertainties on the individuals’ level can be assessed in absence of ground truth annotation, how they relate to traditional, high-level validation practices and how they can be used to improve results for, e.g., nation-wide population estimation

    MOBILITY AND ACTIVITY SPACE: UNDERSTANDING HUMAN DYNAMICS FROM MOBILE PHONE LOCATION DATA

    Get PDF
    Studying human mobility patterns and people’s use of space has been a major focus in geographic research for ages. Recent advancements of location-aware technologies have produced large collections of individual tracking datasets. Mobile phone location data, as one of the many emerging data sources, provide new opportunities to understand how people move around at a relatively low cost and unprecedented scale. However, the increasing data volume, issue of data sparsity, and lack of supplementary information introduce additional challenges when such data are used for human behavioral research. Effective analytical methods are needed to meet the challenges to gain an improved understanding of individual mobility and collective behavioral patterns. This dissertation proposes several approaches for analyzing two types of mobile phone location data (Call Detail Records and Actively Tracked Mobile Phone Location Data) to uncover important characteristics of human mobility patterns and activity spaces. First, it introduces a home-based approach to understanding the spatial extent of individual activity space and the geographic patterns of aggregate activity space characteristics. Second, this study proposes an analytical framework which is capable of examining multiple determinants of individual activity space simultaneously. Third, the study introduces an anchor-point based trajectory segmentation method to uncover potential demand of bicycle trips in a city. The major contributions of this dissertation include: (1) introducing an activity space measure that can be used to evaluate how individuals use urban space around where they live; (2) proposing an analytical framework with three individual mobility indicators that can be used to summarize and compare human activity spaces systematically across different population groups or geographic regions; (3) developing analytical methods for uncovering the spatiotemporal dynamics of travel demand that can be potentially served by bicycles in a city, and providing suggestions for the locations and daily operation of bike sharing stations

    Learning from Structured Data with High Dimensional Structured Input and Output Domain

    Get PDF
    Structured data is accumulated rapidly in many applications, e.g. Bioinformatics, Cheminformatics, social network analysis, natural language processing and text mining. Designing and analyzing algorithms for handling these large collections of structured data has received significant interests in data mining and machine learning communities, both in the input and output domain. However, it is nontrivial to adopt traditional machine learning algorithms, e.g. SVM, linear regression to structured data. For one thing, the structural information in the input domain and output domain is ignored if applying the normal algorithms to structured data. For another, the major challenge in learning from many high-dimensional structured data is that input/output domain can contain tens of thousands even larger number of features and labels. With the high dimensional structured input space and/or structured output space, learning a low dimensional and consistent structured predictive function is important for both robustness and interpretability of the model. In this dissertation, we will present a few machine learning models that learn from the data with structured input features and structured output tasks. For learning from the data with structured input features, I have developed structured sparse boosting for graph classification, structured joint sparse PCA for anomaly detection and localization. Besides learning from structured input, I also investigated the interplay between structured input and output under the context of multi-task learning. In particular, I designed a multi-task learning algorithms that performs structured feature selection & task relationship Inference. We will demonstrate the applications of these structured models on subgraph based graph classification, networked data stream anomaly detection/localization, multiple cancer type prediction, neuron activity prediction and social behavior prediction. Finally, through my intern work at IBM T.J. Watson Research, I will demonstrate how to leverage structural information from mobile data (e.g. call detail record and GPS data) to derive important places from people's daily life for transit optimization and urban planning
    corecore