Search CORE

15 research outputs found

Model of mobility demands for future short distance public transport systems

Author: Gómez Jorge Marx
Halberstadt Jantje
Sandau Alexander
Stamer Daniel
vom Berg Benjamin Wagner
Publication venue: AIS Electronic Library (AISeL)
Publication date: 01/01/2016
Field of study

Short distance public transport faces huge challenges, although it is very important within a sustainable transport system to reduce traffic emissions. Revenues and subsidization are decreasing and especially in rural regions the offer is constantly diminishing. New approaches for public transport systems are strongly needed to avoid traffic infarcts in urban and rural areas to grant a basic offer of mobility services for everyone. In the proposed work a demand centered approach of dynamic public transport planning is introduced which relies on regional traffic data. The approach is based on a demand model which is represented as a dynamic undirected attributed graph. The demands are logged through traffic sensors and sustainability focused traveler information systems

AIS Electronic Library (AISeL)

Towards Building Real-Time, Convenient Route Recommendation System for Public Transit

Author: Agarwal Rachit
Bajaj Garvita
Bouloukakis Georgios
Georgantas Nikolaos
Issarny Valerie
Singh Pushpendra
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/09/2016
Field of study

International audiencePublic transportation is essential for sustainable and economical development of cities. Several transport organizations aim to provide service information to commuters through web and mobile apps. This information includes possible routes between two stations, estimated travel and arrival times, and real-time updates about traffic conditions. However, this information is currently not personalized according to commuter preferences. In this work, we emphasize the need for personalized transit service information to commuters and present a vision of our work in this direction. Our final goal is to develop a fully-functional personalized route recommendation system for public transit commuters. This involves identifying commuter preferences and suitable recommendation techniques, and developing a platform to communicate this information to the commuters. We identify the requirements for the development of this platform, and propose an architecture for our system. As a proof of concept, we present an Android participatory sensing application - MetroCognition, which acquires feedback on convenience experienced by commuters in public transit

Crossref

INRIA a CCSD electronic archive server

Reconstructing individual mobility from smart card transactions: A space alignment approach

Author: Fuzheng Zhang
Guangzhong Sun
Nicholas Jing Yuan
Xing Xie
Yingzi Wang
Publication venue
Publication date: 01/01/2013
Field of study

Abstract-Smart card transactions capture rich information of human mobility and urban dynamics, therefore are of particular interest to urban planners and location-based service providers. However, since most transaction systems are only designated for billing purpose, typically, fine-grained location information, such as the exact boarding and alighting stops of a bus trip, is only partially or not available at all, which blocks deep exploitation of this rich and valuable data at individual level. This paper presents a "space alignment" framework to reconstruct individual mobility history from a large-scale smart card transaction dataset pertaining to a metropolitan city. Specifically, we show that by delicately aligning the monetary space and geospatial space with the temporal space, we are able to extrapolate a series of critical domain specific constraints. Later, these constraints are naturally incorporated into a semi-supervised conditional random field to infer the exact boarding and alighting stops of all transit routes with a surprisingly high accuracy, e.g., given only 10% trips with known alighting/boarding stops, we successfully inferred more than 78% alighting and boarding stops from all unlabeled trips. In addition, we demonstrated that the smart card data enriched by the proposed approach dramatically improved the performance of a conventional method for identifying users' home and work places (with 88% improvement on home detection and 35% improvement on work place detection). The proposed method offers the possibility to mine individual mobility from common public transit transactions, and showcases how uncertain data can be leveraged with domain knowledge and constraints, to support cross-application data mining tasks

CiteSeerX

Determinants of continuance intention of user on smartphone-based traveller information systems in the greater Klang Valley

Author: Wan Rani Wan Suhaila
Publication venue
Publication date: 01/01/2019
Field of study

In these modern-days, the use of mobile traveller information service is pivotal in the efficient and effective running of the transportation system for an urban area. The role of urban facilities managers in urban transportation planning is to develop a plan to provide drivers with real-time traveller information services to enable regional economic growth and transition. Existing research in the mobile information traveller information services area has not deeply investigated the determinants of continuance intention to use smartphone-based traveller information systems (STIS). The purpose of this study is to attempt to do so by investigating STIS users’ continuance intention at the post-adoption phase. This study developed and validated an extended framework based on the expectation-confirmation model (ECM). The 280 STIS users from the Klang Valley highways and major streets participated in the study. The extended ECM results revealed that STIS users’ continuance intention is determined by perceived enjoyment and perceived usefulness of continued STIS use, followed by satisfaction with STIS use. In this study, satisfaction and perceived usefulness are determined primarily by confirmation of expectation from participants’ previous use, except for the perceived enjoyment. The findings of this study have implications for the transportation sectors in planning their strategies to increase users’ continuance intention to use STIS services. Most of the current literature in mobile information services studies focused only on pre-adoption and have paid little attention to user’s continuance intention, especially in the context of smartphone apps or electronic information in the transportation system services. This study fills the theoretical and practical gaps by focusing on the post-adoption phase and developed an extended framework based on the ECM to explain the STIS continuance intention context. In addition, the topic is timely, as mobile information services have been flourishing in the current worldwide transportation sector services

Universiti Teknologi Malaysia Institutional Repository

Prioritization of effective variables in the smart marine fleet for short-distance voyages in order to improve managerial and competitive performance

Author: Berg Sharon
Nourafshan Farhad
Publication venue: Høgskolen i Molde - Vitenskapelig høgskole i logistikk
Publication date: 01/01/2023
Field of study

Brage HiM

Mining Public Transport Usage for Personalised Intelligent Transport Systems

Author: Capra L
Froehlich J
Lathia N
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/01/2011
Field of study

Traveller information, route planning, and service updates have become essential components of public transport systems: they help people navigate built environments by providing access to information regarding delays and service disruptions. However, one aspect that these systems lack is a way of tailoring the information they offer in order to provide personalised trip time estimates and relevant notifications to each traveller. Mining each user's travel history, collected by automated ticketing systems, has the potential to address this gap. In this work, we analyse one such dataset of travel history on the London underground. We then propose and evaluate methods to (a) predict personalised trip times for the system users and (b) rank stations based on future mobility patterns, in order to identify the subset of stations that are of greatest interest to the user and thus provide useful travel updates. © 2010 IEEE

Crossref

UCL Discovery

Recommended from our members

Optimising the Loading Diversity of Rail Passenger Crowding using On-Board Occupancy Data

Author: Ball Simon David
Publication venue
Publication date: 14/12/2016
Field of study

Crowded conditions on trains can lead to lower passenger satisfaction, discourage rail travel, result in negative economic impacts and are a factor in a number of health and safety hazards. In the UK there is an annual survey of rail passenger crowding, although the measures used do not reflect coach-by-coach variations, nor do they reflect variations across the peak period. In this MPhil thesis I investigated the application of weight-based automatic passenger counting data to deliver more even loadings on trains through the provision of new real-time and static solutions. In addition I investigated the potential benefits of such solutions in terms of reduced dwell times and reduced crowding. The overall concept proposed was to make the most of the existing available capacity; for example, so that no-one is standing when seats are available. Through analysing a large sample of air suspension data, I identified station-specific trends where some coaches were over capacity while others had spare capacity. I also conducted a critical review of academic research into on-train crowding and solutions that seek to optimise ‘loading diversity’. This study contributes to this emerging subject area in several ways: I propose two new metrics to describe inter-coach loading diversity that, unlike existing metrics, contain information relative to the capacity; I have revealed a link between the inter-coach loading diversity metrics and estimated boarding times, with trains classified as ‘very uneven’ on departure typically having dwell times of approximately five to ten seconds greater than services that were classified as being ‘even’ with a similar total number of passengers on board; and finally I have applied classification supervised learning techniques to predict the load factor for a given service and these predictors were an improvement over taking the historical average

Open Research Online (The Open University)

Forestogram: Biclustering Visualization Framework with Applications in Public Transport and Bioinformatics

Author: Ghaemi Mohammad Sajjad
Publication venue
Publication date: 01/12/2017
Field of study

RÉSUMÉ : Dans de nombreux problèmes d’analyse de données, les données sont exprimées dans une matrice avec les sujets en ligne et les attributs en colonne. Les méthodes de segmentations traditionnelles visent à regrouper les sujets (lignes), selon des critères de similitude entre ces sujets. Le but est de constituer des groupes de sujets (lignes) qui partagent un certain degré de ressemblance. Les groupes obtenus permettent de garantir que les sujets partagent des similitudes dans leurs attributs (colonnes), il n’y a cependant aucune garantie sur ce qui se passe au niveau des attributs (les colonnes). Dans certaines applications, un regroupement simultané des lignes et des colonnes appelé biclustering de la matrice de données peut être souhaité. Pour cela, nous concevons et développons un nouveau cadre appelé Forestogram, qui permet le calcul de ce regroupement simultané des lignes et des colonnes (biclusters)dans un mode hiérarchique. Le regroupement simultané des lignes et des colonnes de manière hiérarchique peut aider les praticiens à mieux comprendre comment les groupes évoluent avec des propriétés théoriques intéressantes. Forestogram, le nouvel outil de calcul et de visualisation proposé, pourrait être considéré comme une extension 3D du dendrogramme, avec une fusion orthogonale étendue. Chaque bicluster est constitué d’un groupe de lignes (ou de sujets) qui déplie un schéma fortement corrélé avec le groupe de colonnes (ou attributs) correspondantes. Cependant, au lieu d’effectuer un clustering bidirectionnel indépendamment de chaque côté, nous proposons un algorithme de biclustering hiérarchique qui prend les lignes et les colonnes en même temps pour déterminer les biclusters. De plus, nous développons un critère d’information basé sur un modèle qui fournit un nombre estimé de biclusters à travers un ensemble de configurations hiérarchiques au sein du forestogramme sous des hypothèses légères. Nous étudions le cadre suggéré dans deux perspectives appliquées différentes, l’une dans le domaine du transport en commun, l’autre dans le domaine de la bioinformatique. En premier lieu, nous étudions le comportement des usagers dans le transport en commun à partir de deux informations distinctes, les données temporelles et les coordonnées spatiales recueillies à partir des données de transaction de la carte à puce des usagers. Dans de nombreuses villes, les sociétés de transport en commun du monde entier utilisent un système de carte à puce pour gérer la perception des tarifs. L’analyse de cette information fournit un aperçu complet de l’influence de l’utilisateur dans le réseau de transport en commun interactif. À cet égard, l’analyse des données temporelles, décrivant l’heure d’entrée dans le réseau de transport en commun est considérée comme la composante la plus importante des données recueillies à partir des cartes à puce. Les techniques classiques de segmentation, basées sur la distance, ne sont pas appropriées pour analyser les données temporelles. Une nouvelle projection intuitive est suggérée pour conserver le modèle de données horodatées. Ceci est introduit dans la méthode suggérée pour découvrir le modèle temporel comportemental des utilisateurs. Cette projection conserve la distance temporelle entre toute paire arbitraire de données horodatées avec une visualisation significative. Par conséquent, cette information est introduite dans un algorithme de classification hiérarchique en tant que méthode de segmentation de données pour découvrir le modèle des utilisateurs. Ensuite, l’heure d’utilisation est prise en compte comme une variable latente pour rendre la métrique euclidienne appropriée dans l’extraction du motif spatial à travers notre forestogramme. Comme deuxième application, le forestogramme est testé sur un ensemble de données multiomiques combinées à partir de différentes mesures biologiques pour étudier comment l’état de santé des patientes et les modalités biologiques correspondantes évoluent hiérarchiquement au cours du terme de la grossesse, dans chaque bicluster. Le maintien de la grossesse repose sur un équilibre finement équilibré entre la tolérance à l’allogreffe foetale et la protection mécanismes contre les agents pathogènes envahissants. Malgré l’impact bien établi du développement pendant les premiers mois de la grossesse sur les résultats à long terme, les interactions entre les divers mécanismes biologiques qui régissent la progression de la grossesse n’ont pas été étudiées en détail. Démontrer la chronologie de ces adaptations à la grossesse à terme fournit le cadre pour de futures études examinant les déviations impliquées dans les pathologies liées à la grossesse, y compris la naissance prématurée et la prééclampsie. Nous effectuons une analyse multi-physique de 51 échantillons de 17 femmes enceintes, livrant à terme. Les ensembles de données comprennent des mesures de l’immunome, du transcriptome, du microbiome, du protéome et du métabolome d’échantillons obtenus simultanément chez les mêmes patients. La modélisation prédictive multivariée utilisant l’algorithme Elastic Net est utilisée pour mesurer la capacité de chaque ensemble de données à prédire l’âge gestationnel. En utilisant la généralisation empilée, ces ensembles de données sont combinés en un seul modèle. Ce modèle augmente non seulement significativement le pouvoir prédictif en combinant tous les ensembles de données, mais révèle également de nouvelles interactions entre différentes modalités biologiques. En outre, notre forestogramme suggéré est une autre ligne directrice avec l’âge gestationnel au moment de l’échantillonnage qui fournit un modèle non supervisé pour montrer combien d’informations supervisées sont nécessaires pour chaque trimestre pour caractériser les changements induits par la grossesse dans Microbiome, Transcriptome, Génome, Exposome et Immunome réponses efficacement.----------ABSTRACT : In many statistical modeling problems data are expressed in a matrix with subjects in row and attributes in column. In this regard, simultaneous grouping of rows and columns known as biclustering of the data matrix is desired. We design and develop a new framework called Forestogram, with the aim of fast computational and hierarchical illustration of biclusters. Often in practical data analysis, we deal with a two-dimensional object known as the data matrix, where observations are expressed as samples (or subjects) in rows, and attributes (or features) in columns. Thus, simultaneous grouping of rows and columns in a hierarchical manner helps practitioners better understanding how clusters evolve. Forestogram, a novel computational and visualization tool, could be thought of as a 3D expansion of dendrogram, with extended orthogonal merge. Each bicluster consists of group of rows (or samples) that unfolds a highly-correlated schema with their corresponding group of columns (or attributes). However, instead of performing two-way clustering independently on each side, we propose a hierarchical biclustering algorithm which takes rows and columns at the same time to determine the biclusters. Furthermore, we develop a model-based information criterion which provides an estimated number of biclusters through a set of hierarchical configurations within the forestogram under mild assumptions. We study the suggested framework in two different applied perspectives, one in public transit domain, another one in bioinformatics field. First, we investigate the users’ behavior in public transit based on two distinct information, temporal data and spatial coordinates gathered from smart card. In many cities, worldwide public transit companies use smart card system to manage fare collection. Analysis of this information provides a comprehensive insight of user’s influence in the interactive public transit network. In this regard, analysis of temporal data, describing the time of entering to the public transit network is considered as the most substantial component of the data gathered from the smart cards. Classical distance-based techniques are not always suitable to analyze this time series data. A novel projection with intuitive visual map from higher dimension into a three-dimensional clock-like space is suggested to reveal the underlying temporal pattern of public transit users. This projection retains the temporal distance between any arbitrary pair of time-stamped data with meaningful visualization. Consequently, this information is fed into a hierarchical clustering algorithm as a method of data segmentation to discover the pattern of users. Then, the time of the usage is taken as a latent variable into account to make the Euclidean metric appropriate for extracting the spatial pattern through our forestogram. As a second application, forestogram is tested on a multiomics dataset combined from different biological measurements to study how patients and corresponding biological modalities evolve hierarchically in each bicluster over the term of pregnancy. The maintenance of pregnancy relies on a finely-tuned balance between tolerance to the fetal allograft and protective mechanisms against invading pathogens. Despite the well-established impact of development during the early months of pregnancy on long-term outcomes, the interactions between various biological mechanisms that govern the progression of pregnancy have not been studied in details. Demonstrating the chronology of these adaptations to term pregnancy provides the framework for future studies examining deviations implicated in pregnancy-related pathologies including preterm birth and preeclampsia. We perform a multiomics analysis of 51 samples from 17 pregnant women, delivering at term. The datasets include measurements from the immunome, transcriptome, microbiome, proteome, and metabolome of samples obtained simultaneously from the same patients. Multivariate predictive modeling using the Elastic Net algorithm is used to measure the ability of each dataset to predict gestational age. Using stacked generalization, these datasets are combined into a single model. This model not only significantly increases the predictive power by combining all datasets, but also reveals novel interactions between different biological modalities. Furthermore, our suggested forestogram is another guideline along with the gestational age at time of sampling that provides an unsupervised model to show how much supervised information is necessary for each trimester to characterize the pregnancy-induced changes in Microbiome, Transcriptome, Genome, Exposome, and Immunome responses effectively

PolyPublie

Méthodes spatio-temporelles de fouilles des données de cartes à puce en transport urbain

Author: He Li
Publication venue
Publication date: 01/06/2019
Field of study

RÉSUMÉ: Les données des cartes à puce du système de transport en commun sont utiles pour comprendre le comportement des usagers du réseau du transport en commun. De nombreuses recherches pertinentes ont déjà été menées concernant : (1) l'utilisation de données de cartes à puce, (2) les techniques de fouille de données et (3) l'utilisation de la fouille de données avec des données de cartes à puce. Dans ces recherches, la classification des comportements des usagers est basée sur des déplacements pour lesquels les classifications temporelles et spatiales sont considérées comme des processus séparés. Nos partenaires de recherche ont exprimé le souhait de pouvoir examiner les comportements des usagers en considérant simultanément les dimensions spatiales et temporelles. Dans cette thèse, nous développons des méthodes, basées sur les comportements quotidiens des usagers, prenant en compte à la fois les comportements spatiaux et temporels. La méthodologie développée pour classifier les comportements des utilisateurs de cartes à puce s’appuie sur la méthode de distance corrélation croisée (cross correlation distance, ou CCD), sur la déformation temporelle dynamique (dynamic time warping ou DTW), sur la classification hiérarchique et sur l'échantillonnage. De plus, une méthode basée sur la densité est aussi abordée. Cette thèse est contribuée de quatre articles plus d’autre résultats présentés dans un chapitre distinct: (1) Afin de commencer la classification temporelle, une comparaison entre CCD et DTW est faite en vue de choisir la meilleure métrique et développer une méthode de classification des séries temporelles en utilisant la classification hiérarchique, et CCD a été prouvé meilleur dans ce cas-ci. Avec cette méthode proposée, un morceau des comportements temporels peut être classifié. (2) Afin de réaliser la classification temporelle pour les données massives, une méthode d’échantillonnage permettant de traiter les grands volumes de données provenant des systèmes de cartes à puce de transport en commun ainsi qu’un indicateur de calibration de cette méthode sont proposés. Cette méthode d’échantillonnage nous permet de classifier tous les comportements temporels d’usagers dans un réseau de transports en commun, et cet indicateur nous permet de choisir les meilleurs paramètres dans l’algorithme. (3) Afin de regrouper les comportements spatiaux et spatio-temporels d’usagers en transport en commun, des méthodes de classification spatiale et spatio-temporelle de comportements des usagers en ajustant l’algorithme de DTW sont développées, et des méthodes de visualisation des résultats en appliquant un graphique spatio-temporel en 3 dimensions sont aussi développées, en vue de montrer l'efficacité de l'algorithme. La visualisation des résultats nous montre l’effectivité de ces deux méthodes. (4) Afin de tester si la méthode de classification développée dans une ville s’applique dans une autre ville, nous développons une méthode de reconnaissance et de comparaison des comportements de deux villes entre le Canada et le Chile. Les résultats montent qu’environ 66% de comportements temporelles peuvent être reconnu donné un profile de transaction d’un jour, et l’exactitude de reconnaissance est environ 70%. (5) Afin d’analyser les résultats de les classifications spatiale et spatio-temporelle plus profonde, des analyses sont faits incluant la proportion de métro, le moyen et la déviation de trajectoire espace-temps etc, et ces analyse nous permet d’identifier les différences de demande entre les groupes obtenus. (6) En outre, des méthodes de classification de zones géographiques basées sur la densité pour la mesure du changement de comportements des usagers sont développés. Afin de tester ces méthodes, des données massives provenant des systèmes de perception automatique de la Société de Transport l’Outaouais (STO) de Gatineau et de TranSantiago de Santiago (Chili) sont utilisées. Concernant l’implémentation, les méthodes proposées sont programmées en Python. Les résultats des méthodes, non seulement permettent de regrouper les profils des usagers du transport en commun en quelques groupes et de mieux connaître les caractéristiques de chacun, mais aussi de développer une série de méthodes de visualisation, avec lesquelles les données peuvent être traitées automatiquement pour que des graphiques soient générés. Grâce à ces graphiques, les autorités de transport en commun peuvent traduire les données recueillies automatiquement pour illustrer la demande de transport. Par conséquent, des chercheurs espèrent ces contributions aideront les autorités pour planifier les transports en commun afin de mieux répondre aux demandes des citoyens.----------ABSTRACT: Transit smart card data is useful for understanding the behavior of transit users. Numerous relevant research has been conducted on: (1) the use of smart card data, (2) data mining techniques and (3) the use of data mining with smart card data. In this research, the classification of user behavior is based on travel in which temporal and spatial classifications are considered as separate processes. We develop methods, based on the daily behaviors of users, taking into account both spatial and temporal behaviors. The methodology developed to classify the behavior of smart card users is based on the cross correlation distance (CCD) method, dynamic time warping (DTW), hierarchical classification and sampling method. In addition, the density-based method is also affected. This thesis is presented with four articles plus other results in a separate chapter: (1) In order to start the temporal classification, a comparison between CCD and DTW is made in order to choose the best metric and develop a method of classification of time series using hierarchical classification. CCD has been proved better in this case. A piece of temporal behaviors can be classified with this proposed method. (2) In order to achieve temporal classification for Big Data, a sampling method for processing large volumes of data from transit smart card systems and a calibration indicator for this method are proposed. This sampling method allows us to classify all the users’ temporal behaviors in a public transport network, and this indicator allows us to choose the best parameters in the algorithm. (3) In order to classify the spatial and spatio-temporal behavior of users in public transport, methods of spatial and spatio-temporal classification of user behaviors by adjusting the DTW algorithm is developed, and a method of visualization of the results by applying a 3-dimensional spatio-temporal graph is also developed, to show the efficiency of the algorithm. The visualization of the results shows us the effectiveness of these two methods. (4) In order to test whether the classification method developed in one city applies in another city, we develop a method to recognize and compare the behavior of two cities between Canada and Chile. The results show that about 66% of temporal behaviors can be recognized given one-day transaction profiles of two cities, and the recognition accuracy is about 70%. (5) For a deeper view of the spatio-temporal classifications results, analyzes are made including the proportion of metro utilisation, the mean and the deviation of space-time trajectory etc, and these analyses allow us to identify the differences of demands between the clusters obtained. (6) In addition, density-based geographic classification methods for measuring the change of user behavior are developed. To test these methods, massive data from the Automated Collection System of the la Société de Transport l’Outaouais (STO) and the TranSantiago of Santiago de Chile are used. Regarding the implementation, the proposed methods are programmed in python. The result of these methods not only allows the profiles of transit users to be grouped in a few groups and better understand the characteristics of each, but also creates a series of visualization approaches with which data can be directly transferred to the graphs. With these graphs, transit authorities can translate automatically collected data into traveler demand. As a result, researchers hope that these contributions help the authorities to plan public transit by better meeting the demands of citizens

PolyPublie