5,536 research outputs found

    Visualizing the dynamics of London's bicycle hire scheme

    Get PDF
    Visualizing flows between origins and destinations can be straightforward when dealing with small numbers of journeys or simple geographies. Representing flows as lines embedded in geographic space has commonly been used to map transport flows, especially when geographic patterns are important as they are when characterising cities or managing transportation. However, for larger numbers of flows, this approach requires careful design to avoid problems of occlusion, salience bias and information overload. Driven by the requirements identified by users and managers of the London Bicycle Hire scheme we present three methods of representation of bicycle hire use and travel patterns. Flow maps with curved flow symbols are used to show overviews in flow structures. Gridded views of docking station location that preserve geographic relationships are used to explore docking station status over space and time in a graphically efficient manner. Origin-Destination maps that visualise the OD matrix directly while maintaining geographic context are used to provide visual details on demand. We use these approaches to identify changes in travel behaviour over space and time, to aid station rebalancing and to provide a framework for incorporating travel modelling and simulation

    A Generalisable Data Fusion Framework to Infer Mode of Transport Using Mobile Phone Data

    Full text link
    Cities often lack up-to-date data analytics to evaluate and implement transport planning interventions to achieve sustainability goals, as traditional data sources are expensive, infrequent, and suffer from data latency. Mobile phone data provide an inexpensive source of geospatial information to capture human mobility at unprecedented geographic and temporal granularity. This paper proposes a method to estimate updated mode of transportation usage in a city, with novel usage of mobile phone application traces to infer previously hard to detect modes, such as bikes and ride-hailing/taxi. By using data fusion and matrix factorisation, we integrate socioeconomic and demographic attributes of the local resident population into the model. We tested the method in a case study of Santiago (Chile), and found that changes from 2012 to 2020 in mode of transportation inferred by the method are coherent with expectations from domain knowledge and the literature, such as ride-hailing trips replacing mass transport.Comment: 19 pages, 8 figure

    Predictive trend mining for social network analysis

    Get PDF
    This thesis describes research work within the theme of trend mining as applied to social network data. Trend mining is a type of temporal data mining that provides observation into how information changes over time. In the context of the work described in this thesis the focus is on how information contained in social networks changes with time. The work described proposes a number of data mining based techniques directed at mechanisms to not only detect change, but also support the analysis of change, with respect to social network data. To this end a trend mining framework is proposed to act as a vehicle for evaluating the ideas presented in this thesis. The framework is called the Predictive Trend Mining Framework (PTMF). It is designed to support "end-to-end" social network trend mining and analysis. The work described in this thesis is divided into two elements: Frequent Pattern Trend Analysis (FPTA) and Prediction Modeling (PM). For evaluation purposes three social network datasets have been considered: Great Britain Cattle Movement, Deeside Insurance and Malaysian Armed Forces Logistic Cargo. The evaluation indicates that a sound mechanism for identifying and analysing trends, and for using this trend knowledge for prediction purposes, has been established

    Estimating poverty maps from aggregated mobile communication networks

    Get PDF
    Governments and other organisations often rely on data collected by household surveys and censuses to provide estimates of household poverty and identify areas in most need of regeneration and development investment. However, due to the high cost associated with manual data collection and processing, many developing countries conduct such surveys very infrequently, if at all, and only at a coarse level of spatial granularity. Consequently, it becomes difficult for governments and NGOs to determine where and when to intervene. This thesis addresses this problem by examining the feasibility of deriving up to date and high resolution proxy measurements of poverty from an alternative source of data, namely, Call Detail Records (CDRs), which can be used by organisations to help in decision making. Specifically, we contribute the following: 1. A detailed spatial analysis of economic wealth in two sub-Saharan countries, Senegal and Cote d’Ivoire from which we derive two baseline poverty esti- ˆ mators grounded on concrete usage scenarios. 2. We establish a link between communication patterns and wealth through a simulation-based analysis of information diffusion. We further examine the influence of contextual factors, including data quality issues and economic volatility, on the strength of this relationship. 3. An approach to building wealth prediction models based on features of aggregated CDRs. Features include static and simulation based measures of information access, activity based metrics and econometric inspired metrics. We further perform a comparative analysis of the results of several models in relation to the baseline predictors. We conclude that it is possible to produce proxy poverty or wealth indicators from aggregated CDRs that provide a good level of accuracy, particularly where geographical coverage of the mobile phone network is sufficient. The final outcome of this thesis is a method for developing aggregated CDR-based poverty or wealth models that can be readily implemented anywhere in which there is a need for more up to date and/or finer resolution poverty estimates

    Programmable Insight: A Computational Methodology to Explore Online News Use of Frames

    Get PDF
    abstract: The Internet is a major source of online news content. Online news is a form of large-scale narrative text with rich, complex contents that embed deep meanings (facts, strategic communication frames, and biases) for shaping and transitioning standards, values, attitudes, and beliefs of the masses. Currently, this body of narrative text remains untapped due—in large part—to human limitations. The human ability to comprehend rich text and extract hidden meanings is far superior to known computational algorithms but remains unscalable. In this research, computational treatment is given to online news framing for exposing a deeper level of expressivity coined “double subjectivity” as characterized by its cumulative amplification effects. A visual language is offered for extracting spatial and temporal dynamics of double subjectivity that may give insight into social influence about critical issues, such as environmental, economic, or political discourse. This research offers benefits of 1) scalability for processing hidden meanings in big data and 2) visibility of the entire network dynamics over time and space to give users insight into the current status and future trends of mass communication.Dissertation/ThesisDoctoral Dissertation Computer Science 201

    Counting equivalence classes of Boolean functions

    Get PDF
    У овој дисертацији разматран јe проблем израчунавања броја класа еквиваленције Булових функција. Тежина одређивања броја класа еквивален- ције нагло расте са бројем променљивих n. Мотивација за избор ове теме лежи у чињеници да су конкретни бројеви до сада били познати само за релативно мале вредности n, иако је сам проблем теоријски одавно решен...In this dissertation, the problem of calculating the number of equiva- lence classes of Boolean functions is discussed. The difficulty of determining the number of equivalence classes increases sharply with the number of variables n. The motivation for choosing this topic lies in the fact that concrete numbers have been known so far only for relatively small values of n, although the problem itself was theoretically solved a long time ago..

    Analysis of Family-Health-Related Topics on Wikipedia

    Get PDF
    New concepts, terms, and topics always emerge; and meanings of existing terms and topics keep changing all the time. These phenomena occur more frequently on social media than on conventional media because social media allows a huge number of users to generate information online. Retrieving relevant results in different time periods of a fast-changing topic becomes one of the most difficult challenges in the information retrieval field. Among numerous topics discussed on social media, health-related topics are a major category which attracts increasing attention from the general public. This study investigated and explored the evolution patterns of family-health-related topics on Wikipedia. Three family-health-related topics (Child Maltreatment, Family Planning, and Women’s Health) were selected from the World Health Organization Website and their associated entries were retrieved on Wikipedia. Historical numeric and text data of the entries from 2010 to 2017 were collected from a Wikipedia data dump and the Wikipedia Web pages. Four periods were defined: 2010 to 2011, 2012 to 2013, 2014 to 2015, and 2016 to 2017. Coding, subject analysis, descriptive statistical analysis, inferential statistical analysis, SOM approach, and n-gram approach were employed to explore the internal characteristics and external popularity evolutions of the topics. The findings illustrate that the external popularities of the family-health-related topics declined from 2010 to 2017, although their content on Wikipedia kept increasing. The emerged entries had three features: specialization, summarization, and internationalization. The subjects derived from the entries became increasingly diverse during the investigated periods. Meanwhile, the developing trajectories of the subjects varied from one to another. According to the developing trajectories, the subjects were grouped into three categories: growing subject, diminishing subject, and fluctuating subject. The popularities of the topics among the Wikipedia viewers were consistent, while among the editors were not. For each topic, its popularity trend among the editors and the viewers was inconsistent. Child Maltreatment was the most popular among the three topics, Women’s Health was the second most popular, while Family Planning was the least popular among the three. The implications of this study include: (1) helping health professionals and general users get a more comprehensive understanding of the investigated topics; (2) contributing to the developments of health ontologies and consumer health vocabularies; (3) assisting Website designers in organizing online health information and helping them identify popular family-health-related topics; (4) providing a new approach for query recommendation in information retrieval systems; (5) supporting temporal information retrieval by presenting the temporal changes of family-health-related topics; and (6) providing a new combination of data collection and analysis methods for researchers
    corecore