32 research outputs found

    Scaling of city attractiveness for foreign visitors through big data of human economical and social media activity

    Full text link
    Scientific studies investigating laws and regularities of human behavior are nowadays increasingly relying on the wealth of widely available digital information produced by human social activity. In this paper we leverage big data created by three different aspects of human activity (i.e., bank card transactions, geotagged photographs and tweets) in Spain for quantifying city attractiveness for the foreign visitors. An important finding of this papers is a strong superlinear scaling of city attractiveness with its population size. The observed scaling exponent stays nearly the same for different ways of defining cities and for different data sources, emphasizing the robustness of our finding. Temporal variation of the scaling exponent is also considered in order to reveal seasonal patterns in the attractivenessComment: 8 pages, 3 figures, 1 tabl

    Sublinear scaling of country attractiveness observed from Flickr dataset

    Get PDF
    The number of people who decide to share their photographs publicly increases every day, consequently making available new almost real-time insights of human behavior while traveling. Rather than having this statistic once a month or yearly, urban planners and touristic workers now can make decisions almost simultaneously with the emergence of new events. Moreover, these datasets can be used not only to compare how popular different touristic places are, but also predict how popular they should be taking into an account their characteristics. In this paper we investigate how country attractiveness scales with its population and size using number of foreign users taking photographs, which is observed from Flickr dataset, as a proxy for attractiveness. The results showed two things: to a certain extent country attractiveness scales with population, but does not with its size; and unlike in case of Spanish cities, country attractiveness scales sublinearly with population, and not superlinearly.Singapore-MIT Alliance for Research and Technology (SMART)Accenture (Firm)Air liquide (Firm)Coca-Cola CompanyEricsson (Firm)Volkswagen Electronics Research LabUber (Firm)MIT Senseable City Lab Consortiu

    Uncovering Urban Temporal Patterns from Geo-Tagged Photography

    Get PDF
    We live in a world where digital trails of different forms of human activities compose big urban data, allowing us to detect many aspects of how people experience the city in which they live or come to visit. In this study we propose to enhance urban planning by taking into a consideration individual preferences using information from an unconventional big data source: dataset of geo-tagged photographs that people take in cities which we then use as a measure of urban attractiveness. We discover and compare a temporal behavior of residents and visitors in ten most photographed cities in the world. Looking at the periodicity in urban attractiveness, the results show that the strongest periodic patterns for visitors are usually weekly or monthly. Moreover, by dividing cities into two groups based on which continent they belong to (i.e., North America or Europe), it can be concluded that unlike European cities, behavior of visitors in the US cities in general is similar to the behavior of their residents. Finally, we apply two indices, called “dilatation attractiveness index” and “dilatation index”, to our dataset which tell us the spatial and temporal attractiveness pulsations in the city. The proposed methodology is not only important for urban planning, but also does support various business and public stakeholder decision processes, concentrated for example around the question how to attract more visitors to the city or estimate the impact of special events organized there.Singapore-MIT Alliance for Research and Technology (SMART)Center for Complex Engineering Systems (CCES) at KACST and MITCoca-Cola CompanyAccenture (Firm

    Spatiotemporal Variation in Bicycle Road Crashes and Traffic Volume in Berlin: Implications for Future Research, Planning, and Network Design

    No full text
    Urban bicycling has been largely marginalized for decades in the global north and south. Despite a renaissance over the last two decades in academic research, political discourse, sustainability activism, and planning, cities often struggle with data quality and quantity. Digitalization has led to more and better data sources, but they still must be validated and compared with findings from conventional travel surveys. With the COVID-19 pandemic, bicycling and associated road facilities expanded, as did road crashes involving bicycles. This study utilized tens of thousands of datapoints sourced by public institutions and digital devices belonging to private companies that have spread across Berlin over the last ten years and are currently ubiquitous. What does an integrated analysis of data from these novel sources reveal for urban bicycling research, planning, and network design? We explored and visualized the relationships and spatiotemporal variations in (i) bicycling volumes and (ii) crashes, unveiling the (iii) distribution of and correlation between datasets and the city’s bikeway network at an unprecedented threshold. The findings can be useful for special interest groups and to guide future urban bicycling research, planning, and network design

    Urban magnetism through the lens of geo-tagged photography

    No full text
    There is an increasing trend of people leaving digital traces through social media. This reality opens new horizons for urban studies. With this kind of data, researchers and urban planners can detect many aspects of how people live in cities and can also suggest how to transform cities into more efficient and smarter places to live in. In particular, their digital trails can be used to investigate tastes of individuals, and what attracts them to live in a particular city or to spend their vacation there. In this paper we propose an unconventional way to study how people experience the city, using information from geotagged photographs that people take at different locations. We compare the spatial behavior of residents and tourists in 10 most photographed cities all around the world. The study was conducted on both a global and local level. On the global scale we analyze the 10 most photographed cities and measure how attractive each city is for people visiting it from other cities within the same country or from abroad. For the purpose of our analysis we construct the users’ mobility network and measure the strength of the links between each pair of cities as a level of attraction of people living in one city (i.e., origin) to the other city (i.e., destination). On the local level we study the spatial distribution of user activity and identify the photographed hotspots inside each city. The proposed methodology and the results of our study are a low cost mean to characterize touristic activity within a certain location and can help cities strengthening their touristic potential

    Digital Biomarkers for Depression Screening With Wearable Devices: Cross-sectional Study With Machine Learning Modeling

    No full text
    BackgroundDepression is a prevalent mental disorder that is undiagnosed and untreated in half of all cases. Wearable activity trackers collect fine-grained sensor data characterizing the behavior and physiology of users (ie, digital biomarkers), which could be used for timely, unobtrusive, and scalable depression screening. ObjectiveThe aim of this study was to examine the predictive ability of digital biomarkers, based on sensor data from consumer-grade wearables, to detect risk of depression in a working population. MethodsThis was a cross-sectional study of 290 healthy working adults. Participants wore Fitbit Charge 2 devices for 14 consecutive days and completed a health survey, including screening for depressive symptoms using the 9-item Patient Health Questionnaire (PHQ-9), at baseline and 2 weeks later. We extracted a range of known and novel digital biomarkers characterizing physical activity, sleep patterns, and circadian rhythms from wearables using steps, heart rate, energy expenditure, and sleep data. Associations between severity of depressive symptoms and digital biomarkers were examined with Spearman correlation and multiple regression analyses adjusted for potential confounders, including sociodemographic characteristics, alcohol consumption, smoking, self-rated health, subjective sleep characteristics, and loneliness. Supervised machine learning with statistically selected digital biomarkers was used to predict risk of depression (ie, symptom severity and screening status). We used varying cutoff scores from an acceptable PHQ-9 score range to define the depression group and different subsamples for classification, while the set of statistically selected digital biomarkers remained the same. For the performance evaluation, we used k-fold cross-validation and obtained accuracy measures from the holdout folds. ResultsA total of 267 participants were included in the analysis. The mean age of the participants was 33 (SD 8.6, range 21-64) years. Out of 267 participants, there was a mild female bias displayed (n=170, 63.7%). The majority of the participants were Chinese (n=211, 79.0%), single (n=163, 61.0%), and had a university degree (n=238, 89.1%). We found that a greater severity of depressive symptoms was robustly associated with greater variation of nighttime heart rate between 2 AM and 4 AM and between 4 AM and 6 AM; it was also associated with lower regularity of weekday circadian rhythms based on steps and estimated with nonparametric measures of interdaily stability and autocorrelation as well as fewer steps-based daily peaks. Despite several reliable associations, our evidence showed limited ability of digital biomarkers to detect depression in the whole sample of working adults. However, in balanced and contrasted subsamples comprised of depressed and healthy participants with no risk of depression (ie, no or minimal depressive symptoms), the model achieved an accuracy of 80%, a sensitivity of 82%, and a specificity of 78% in detecting subjects at high risk of depression. ConclusionsDigital biomarkers that have been discovered and are based on behavioral and physiological data from consumer wearables could detect increased risk of depression and have the potential to assist in depression screening, yet current evidence shows limited predictive ability. Machine learning models combining these digital biomarkers could discriminate between individuals with a high risk of depression and individuals with no risk

    Utility-driven k-anonymization of public transport user data

    No full text
    In this paper, we propose a k-anonymity approach that prioritizes the generalization of attributes based on their utility. We focus on transport data, which we consider a special case in which many or all attributes are quasi-identifiers (e.g., origin, destination, ride start time), as they allow correlation with easily observable auxiliary data. The novelty in our approach lies in introducing normalization techniques as well as distance and utility metrics that allow the consideration of not only numerical attributes but also categorical attributes by representing them in tree or graph form. The prioritization of the attributes in the generalization process is based on the attributes’ utility and can further be influenced by either automatically or manually assigned attribute weights. We evaluate and compare different options for all components of our mechanism as well as present an extensive performance evaluation of our approach using real-world data. Lastly, we show in which cases suppression of records can counter-intuitively lead to higher data utility.National Research Foundation (NRF)Published versionThis work was supported by the Singapore National Research Foundation through the Campus for Research Excellence and Technological Enterprise (CREATE) Programme
    corecore