7 research outputs found

    Latent Space Model for Multi-Modal Social Data

    Full text link
    With the emergence of social networking services, researchers enjoy the increasing availability of large-scale heterogenous datasets capturing online user interactions and behaviors. Traditional analysis of techno-social systems data has focused mainly on describing either the dynamics of social interactions, or the attributes and behaviors of the users. However, overwhelming empirical evidence suggests that the two dimensions affect one another, and therefore they should be jointly modeled and analyzed in a multi-modal framework. The benefits of such an approach include the ability to build better predictive models, leveraging social network information as well as user behavioral signals. To this purpose, here we propose the Constrained Latent Space Model (CLSM), a generalized framework that combines Mixed Membership Stochastic Blockmodels (MMSB) and Latent Dirichlet Allocation (LDA) incorporating a constraint that forces the latent space to concurrently describe the multiple data modalities. We derive an efficient inference algorithm based on Variational Expectation Maximization that has a computational cost linear in the size of the network, thus making it feasible to analyze massive social datasets. We validate the proposed framework on two problems: prediction of social interactions from user attributes and behaviors, and behavior prediction exploiting network information. We perform experiments with a variety of multi-modal social systems, spanning location-based social networks (Gowalla), social media services (Instagram, Orkut), e-commerce and review sites (Amazon, Ciao), and finally citation networks (Cora). The results indicate significant improvement in prediction accuracy over state of the art methods, and demonstrate the flexibility of the proposed approach for addressing a variety of different learning problems commonly occurring with multi-modal social data.Comment: 12 pages, 7 figures, 2 table

    Adapting the Standard SIR Disease Model in Order to Track and Predict the Spreading of the EBOLA Virus Using Twitter Data

    Get PDF
    A method has been developed to track infectious diseases by using data mining of active Twitter accounts and its efficacy was demonstrated during the West African Ebola outbreak of 2014. Using a meme based n-gram semantic usage model to search the Twitter database for indications of illness, flight and death from the spread of Ebola in Africa, principally from Guinea, Sierra Leone and Liberia. Memes of interest relate disease to location and severity and are coupled to the density of Tweets and re-Tweets. The meme spreads through the community of social users in a fashion similar to nonlinear wave propagation- like a shock wave, visualized as a spike in Tweet activity. The spreading was modeled as a system isomorphic to a modified SIR (Susceptible, Infected, Removed disease model) system of three coupled nonlinear differential equations using Twitter variables. The nonlinear terms in this model lead to feedback mechanisms that result in unusual behavior that does not always reduce the spread of the disease. The resulting geographic Tweet densities are coupled to geographic maps of the region. These maps have specific threat levels that are ported to a mobile application (app) and can be used by travelers to assess the relative safety of the region they will be in

    An overview on user profiling in online social networks

    Get PDF
    Advances in Online Social Networks is creating huge data day in and out providing lot of opportunities to its users to express their interest and opinion. Due to the popularity and exposure of social networks, many intruders are using this platform for illegal purposes. Identifying such users is challenging and requires digging huge knowledge out of the data being flown in the social media. This work gives an insight to profile users in online social networks. User Profiles are established based on the behavioral patterns, correlations and activities of the user analyzed from the aggregated data using techniques like clustering, behavioral analysis, content analysis and face detection. Depending on application and purpose, the mechanism used in profiling users varies. Further study on other mechanisms used in profiling users is under the scope of future endeavors

    Analysis of a heterogeneous social network of humans and cultural objects

    No full text
    Modern online social platforms allow their members to be involved in a broad range of activities including getting friends, joining groups, posting and commenting resources. In this article we investigate whether a correlation emerges across the different activities a user can take part in. For our analysis we focused on aNobii, a social platform with a world-wide user base of book readers, who post their readings, give ratings, review books and discuss them with friends and fellow readers. aNobii presents a heterogeneous structure: i) part social network, with user-to-user interactions, ii) part interest network, with the management of book collections, and iii) part folksonomy, with books that are tagged by the users. We analysed a complete snapshot of aNobii and we focused on three specific activities a user can perform, namely tagging behaviour, tendency to join groups and aptitude to compile a wishlist of the books one is planning to read. For each user we create a tag-based, a group-based and a wishlist-based profile. Experimental analysis , which was carried out with Information-Theory tools like entropy and mutual information, suggests that tag-based and group-based profiles are in general more informative than wishlist-based ones. Furthermore, we discover that the degree of correlation between the three profiles associated with the same user tend to be small. Hence, user profiling cannot be reduced to considering just any one type of user activity (albeit important) but it is crucial to incorporate multiple dimensions to effectively describe users' preferences and behaviour
    corecore