32 research outputs found

    Using Robust PCA to estimate regional characteristics of language use from geo-tagged Twitter messages

    Full text link
    Principal component analysis (PCA) and related techniques have been successfully employed in natural language processing. Text mining applications in the age of the online social media (OSM) face new challenges due to properties specific to these use cases (e.g. spelling issues specific to texts posted by users, the presence of spammers and bots, service announcements, etc.). In this paper, we employ a Robust PCA technique to separate typical outliers and highly localized topics from the low-dimensional structure present in language use in online social networks. Our focus is on identifying geospatial features among the messages posted by the users of the Twitter microblogging service. Using a dataset which consists of over 200 million geolocated tweets collected over the course of a year, we investigate whether the information present in word usage frequencies can be used to identify regional features of language use and topics of interest. Using the PCA pursuit method, we are able to identify important low-dimensional features, which constitute smoothly varying functions of the geographic location

    The effect of network structure on phase transitions in queuing networks

    Get PDF
    Recently, De Martino et al have presented a general framework for the study of transportation phenomena on complex networks. One of their most significant achievements was a deeper understanding of the phase transition from the uncongested to the congested phase at a critical traffic load. In this paper, we also study phase transition in transportation networks using a discrete time random walk model. Our aim is to establish a direct connection between the structure of the graph and the value of the critical traffic load. Applying spectral graph theory, we show that the original results of De Martino et al showing that the critical loading depends only on the degree sequence of the graph -- suggesting that different graphs with the same degree sequence have the same critical loading if all other circumstances are fixed -- is valid only if the graph is dense enough. For sparse graphs, higher order corrections, related to the local structure of the network, appear.Comment: 12 pages, 7 figure

    Spatial Fingerprints of Community Structure in Human Interaction Network for an Extensive Set of Large-Scale Regions

    Get PDF
    Human interaction networks inferred from country-wide telephone activity recordings were recently used to redraw political maps by projecting their topological partitions into geographical space. The results showed remarkable spatial cohesiveness of the network communities and a significant overlap between the redrawn and the administrative borders. Here we present a similar analysis based on one of the most popular online social networks represented by the ties between more than 5.8 million of its geo-located users. The worldwide coverage of their measured activity allowed us to analyze the large-scale regional subgraphs of entire continents and an extensive set of examples for single countries. We present results for North and South America, Europe and Asia. In our analysis we used the well- established method of modularity clustering after an aggregation of the individual links into a weighted graph connecting equal- area geographical pixels. Our results show fingerprints of both of the opposing forces of dividing local conflicts and of uniting cross-cultural trends of globalization

    Short-time behavior of continuous-time quantum walks on graphs

    Get PDF
    Dynamical evolution of systems with sparse Hamiltonians can always be recognized as continuous-time quantum walks (CTQWs) on graphs. In this paper, we analyze the short-time asymptotics of CTQWs. In recent studies, it was shown that for the classical diffusion process the short-time asymptotics of the transition probabilities follows power laws whose exponents are given by the usual combinatorial distances of the nodes. Inspired by this result, we perform a similar analysis for CTQWs in both closed and open systems, including time-dependent couplings. For time-reversal symmetric coherent quantum evolutions, the short-time asymptotics of the transition probabilities is completely determined by the topology of the underlying graph analogously to the classical case, but with a doubled power-law exponent. Moreover, this result is robust against the introduction of on-site potential terms. However, we show that time-reversal symmetry-breaking terms and noncoherent effects can significantly alter the short-time asymptotics. The analytical formulas are checked against numerics, and excellent agreement is found. Furthermore, we discuss in detail the relevance of our results for quantum evolutions on particular network topologies

    Communities formed in Switzerland.

    No full text
    <p>These are clusters formed within the European continent’s regional subgraph, focusing on the single country of <i>Switzerland</i>. Each cluster of geo-pixels has a different unique color. The <i>red</i> and the <i>purple</i> correspond to the communities concentrated in <i>France</i> and in <i>Italy</i> respectively. The subnational boundaries are marked with thinner lines, depicting the administrative units of this graph.</p
    corecore