5,293 research outputs found

    Scalable Privacy-Compliant Virality Prediction on Twitter

    Get PDF
    The digital town hall of Twitter becomes a preferred medium of communication for individuals and organizations across the globe. Some of them reach audiences of millions, while others struggle to get noticed. Given the impact of social media, the question remains more relevant than ever: how to model the dynamics of attention in Twitter. Researchers around the world turn to machine learning to predict the most influential tweets and authors, navigating the volume, velocity, and variety of social big data, with many compromises. In this paper, we revisit content popularity prediction on Twitter. We argue that strict alignment of data acquisition, storage and analysis algorithms is necessary to avoid the common trade-offs between scalability, accuracy and privacy compliance. We propose a new framework for the rapid acquisition of large-scale datasets, high accuracy supervisory signal and multilanguage sentiment prediction while respecting every privacy request applicable. We then apply a novel gradient boosting framework to achieve state-of-the-art results in virality ranking, already before including tweet's visual or propagation features. Our Gradient Boosted Regression Tree is the first to offer explainable, strong ranking performance on benchmark datasets. Since the analysis focused on features available early, the model is immediately applicable to incoming tweets in 18 languages.Comment: AffCon@AAAI-19 Best Paper Award; Presented at AAAI-19 W1: Affective Content Analysi

    Small worlds and board interlocking in Brazil: a longitudinal study of corporate networks, 1997-2007

    Get PDF
    Social Network Analysis (SNA) is an emerging research field in finance, above all in Brazil. This work is pioneering in that it is supported by reference to different areas of knowledge: social network analysis and corporate governance, for dealing with a similarly emerging topic in finance; interlocking boards, the purpose being to check the validity of the small-world model in the Brazilian capital market, and the existence of associations between the positioning of the firm in the network of corporate relationships and its worth. To do so official data relating to more than 400 companies listed in Brazil between 1997 and 2007 were used. The main results obtained suggest that the configuration of the networks of relationships between board members and companies reflects the small-world model. Furthermore, there seems to be a significant relationship between the firm’s centrality and its worth, described according to an “inverted U” curve, which suggests the existence of optimum values of social prominence in the corporate network.Board Interlocking; Social Network Analysis in Finance; Company Boards

    Knowledge Networks of the Information Technology Management Domain: A Social Network Analysis Approach

    Get PDF
    Using the social network analysis technique, we decomposed the knowledge networks of the information technology management (ITM) domain. We included a total of 893 papers published during the 1995-2014 period in the network analysis. From this domain, the network and ego level properties—such as, degree centralities, density, components, structural holes, and degree distribution—suggest that, unlike the other information systems communities, the ITM is a community with a unique character and distinct collaboration patterns. The results show that the ITM knowledge networks are fragmented and exhibit a power law distribution in which incoming nodes and links prefer to attach to the nodes that are already well connected. We discuss several implications that arise from the network configuration that could aid the future development of the ITM domain

    Finding Influential Users in Social Media Using Association Rule Learning

    Full text link
    Influential users play an important role in online social networks since users tend to have an impact on one other. Therefore, the proposed work analyzes users and their behavior in order to identify influential users and predict user participation. Normally, the success of a social media site is dependent on the activity level of the participating users. For both online social networking sites and individual users, it is of interest to find out if a topic will be interesting or not. In this article, we propose association learning to detect relationships between users. In order to verify the findings, several experiments were executed based on social network analysis, in which the most influential users identified from association rule learning were compared to the results from Degree Centrality and Page Rank Centrality. The results clearly indicate that it is possible to identify the most influential users using association rule learning. In addition, the results also indicate a lower execution time compared to state-of-the-art methods

    Current trends and future directions in knowledge management in construction research using social network analysis

    Get PDF
    The growing interest in Knowledge Management (KM) has led to increased attention to Social Network Analysis (SNA) as a tool to map the relationships in networks. SNA can be used to evaluate knowledge flows between project teams, contributing to collaborative working and improved performance. Similarly, it has the potential to be used for construction projects and organisations. This paper aims at identifying current trends and future research directions related to using SNA for KM in construction. A systematic review and thematic analysis were used to critically review the existing studies and identify potential research areas in construction specifically related to research approaches and explore the possibilities for extension of SNA in KM. The findings revealed that there are knowledge gaps in research approaches with case study-based research involving external stakeholders, collaborations, development of communication protocols, which are priority areas identified for future research. SNA in KM related to construction could be extended to develop models that capture both formal and informal relationships as well as the KM process in pre-construction, construction, and post-construction stages to improve the performance of projects. Similarly, SNA can be integrated with methodological concepts, such as Analytic Hierarchy Process (AHP), knowledge broker, and so forth, to improve KM processes in construction. This study identifies potential research areas that provide the basis for stakeholders and academia to resolve current issues in the use of SNA for KM in construction

    Detecting highly overlapping community structure by greedy clique expansion

    Get PDF
    In complex networks it is common for each node to belong to several communities, implying a highly overlapping community structure. Recent advances in benchmarking indicate that existing community assignment algorithms that are capable of detecting overlapping communities perform well only when the extent of community overlap is kept to modest levels. To overcome this limitation, we introduce a new community assignment algorithm called Greedy Clique Expansion (GCE). The algorithm identifies distinct cliques as seeds and expands these seeds by greedily optimizing a local fitness function. We perform extensive benchmarks on synthetic data to demonstrate that GCE's good performance is robust across diverse graph topologies. Significantly, GCE is the only algorithm to perform well on these synthetic graphs, in which every node belongs to multiple communities. Furthermore, when put to the task of identifying functional modules in protein interaction data, and college dorm assignments in Facebook friendship data, we find that GCE performs competitively.Comment: 10 pages, 7 Figures. Implementation source and binaries available at http://sites.google.com/site/greedycliqueexpansion

    The active microbial community more accurately reflects the anaerobic digestion process: 16S rRNA (gene) sequencing as a predictive tool

    Get PDF
    Background: Amplicon sequencing methods targeting the 16S rRNA gene have been used extensively to investigate microbial community composition and dynamics in anaerobic digestion. These methods successfully characterize amplicons but do not distinguish micro-organisms that are actually responsible for the process. In this research, the archaeal and bacterial community of 48 full-scale anaerobic digestion plants were evaluated on DNA (total community) and RNA (active community) level via 16S rRNA (gene) amplicon sequencing. Results: A significantly higher diversity on DNA compared with the RNA level was observed for archaea, but not for bacteria. Beta diversity analysis showed a significant difference in community composition between the DNA and RNA of both bacteria and archaea. This related with 25.5 and 42.3% of total OTUs for bacteria and archaea, respectively, that showed a significant difference in their DNA and RNA profiles. Similar operational parameters affected the bacterial and archaeal community, yet the differentiating effect between DNA and RNA was much stronger for archaea. Co-occurrence networks and functional prediction profiling confirmed the clear differentiation between DNA and RNA profiles. Conclusions: In conclusion, a clear difference in active (RNA) and total (DNA) community profiles was observed, implying the need for a combined approach to estimate community stability in anaerobic digestion
    corecore