5,293 research outputs found
Scalable Privacy-Compliant Virality Prediction on Twitter
The digital town hall of Twitter becomes a preferred medium of communication
for individuals and organizations across the globe. Some of them reach
audiences of millions, while others struggle to get noticed. Given the impact
of social media, the question remains more relevant than ever: how to model the
dynamics of attention in Twitter. Researchers around the world turn to machine
learning to predict the most influential tweets and authors, navigating the
volume, velocity, and variety of social big data, with many compromises. In
this paper, we revisit content popularity prediction on Twitter. We argue that
strict alignment of data acquisition, storage and analysis algorithms is
necessary to avoid the common trade-offs between scalability, accuracy and
privacy compliance. We propose a new framework for the rapid acquisition of
large-scale datasets, high accuracy supervisory signal and multilanguage
sentiment prediction while respecting every privacy request applicable. We then
apply a novel gradient boosting framework to achieve state-of-the-art results
in virality ranking, already before including tweet's visual or propagation
features. Our Gradient Boosted Regression Tree is the first to offer
explainable, strong ranking performance on benchmark datasets. Since the
analysis focused on features available early, the model is immediately
applicable to incoming tweets in 18 languages.Comment: AffCon@AAAI-19 Best Paper Award; Presented at AAAI-19 W1: Affective
Content Analysi
Small worlds and board interlocking in Brazil: a longitudinal study of corporate networks, 1997-2007
Social Network Analysis (SNA) is an emerging research field in finance, above all in Brazil. This work is pioneering in that it is supported by reference to different areas of knowledge: social network analysis and corporate governance, for dealing with a similarly emerging topic in finance; interlocking boards, the purpose being to check the validity of the small-world model in the Brazilian capital market, and the existence of associations between the positioning of the firm in the network of corporate relationships and its worth. To do so official data relating to more than 400 companies listed in Brazil between 1997 and 2007 were used. The main results obtained suggest that the configuration of the networks of relationships between board members and companies reflects the small-world model. Furthermore, there seems to be a significant relationship between the firm’s centrality and its worth, described according to an “inverted U” curve, which suggests the existence of optimum values of social prominence in the corporate network.Board Interlocking; Social Network Analysis in Finance; Company Boards
Knowledge Networks of the Information Technology Management Domain: A Social Network Analysis Approach
Using the social network analysis technique, we decomposed the knowledge networks of the information technology management (ITM) domain. We included a total of 893 papers published during the 1995-2014 period in the network analysis. From this domain, the network and ego level properties—such as, degree centralities, density, components, structural holes, and degree distribution—suggest that, unlike the other information systems communities, the ITM is a community with a unique character and distinct collaboration patterns. The results show that the ITM knowledge networks are fragmented and exhibit a power law distribution in which incoming nodes and links prefer to attach to the nodes that are already well connected. We discuss several implications that arise from the network configuration that could aid the future development of the ITM domain
Finding Influential Users in Social Media Using Association Rule Learning
Influential users play an important role in online social networks since
users tend to have an impact on one other. Therefore, the proposed work
analyzes users and their behavior in order to identify influential users and
predict user participation. Normally, the success of a social media site is
dependent on the activity level of the participating users. For both online
social networking sites and individual users, it is of interest to find out if
a topic will be interesting or not. In this article, we propose association
learning to detect relationships between users. In order to verify the
findings, several experiments were executed based on social network analysis,
in which the most influential users identified from association rule learning
were compared to the results from Degree Centrality and Page Rank Centrality.
The results clearly indicate that it is possible to identify the most
influential users using association rule learning. In addition, the results
also indicate a lower execution time compared to state-of-the-art methods
Current trends and future directions in knowledge management in construction research using social network analysis
The growing interest in Knowledge Management (KM) has led to increased attention to Social Network Analysis (SNA) as a tool to map the relationships in networks. SNA can be used to evaluate knowledge flows between project teams, contributing to collaborative working and improved performance. Similarly, it has the potential to be used for construction projects and organisations. This paper aims at identifying current trends and future research directions related to using SNA for KM in construction. A systematic review and thematic analysis were used to critically review the existing studies and identify potential research areas in construction specifically related to research approaches and explore the possibilities for extension of SNA in KM. The findings revealed that there are knowledge gaps in research approaches with case study-based research involving external stakeholders, collaborations, development of communication protocols, which are priority areas identified for future research. SNA in KM related to construction could be extended to develop models that capture both formal and informal relationships as well as the KM process in pre-construction, construction, and post-construction stages to improve the performance of projects. Similarly, SNA can be integrated with methodological concepts, such as Analytic Hierarchy Process (AHP), knowledge broker, and so forth, to improve KM processes in construction. This study identifies potential research areas that provide the basis for stakeholders and academia to resolve current issues in the use of SNA for KM in construction
Detecting highly overlapping community structure by greedy clique expansion
In complex networks it is common for each node to belong to several
communities, implying a highly overlapping community structure. Recent advances
in benchmarking indicate that existing community assignment algorithms that are
capable of detecting overlapping communities perform well only when the extent
of community overlap is kept to modest levels. To overcome this limitation, we
introduce a new community assignment algorithm called Greedy Clique Expansion
(GCE). The algorithm identifies distinct cliques as seeds and expands these
seeds by greedily optimizing a local fitness function. We perform extensive
benchmarks on synthetic data to demonstrate that GCE's good performance is
robust across diverse graph topologies. Significantly, GCE is the only
algorithm to perform well on these synthetic graphs, in which every node
belongs to multiple communities. Furthermore, when put to the task of
identifying functional modules in protein interaction data, and college dorm
assignments in Facebook friendship data, we find that GCE performs
competitively.Comment: 10 pages, 7 Figures. Implementation source and binaries available at
http://sites.google.com/site/greedycliqueexpansion
The active microbial community more accurately reflects the anaerobic digestion process: 16S rRNA (gene) sequencing as a predictive tool
Background:
Amplicon sequencing methods targeting the 16S rRNA gene have been used extensively to investigate microbial community composition and dynamics in anaerobic digestion. These methods successfully characterize amplicons but do not distinguish micro-organisms that are actually responsible for the process. In this research, the archaeal and bacterial community of 48 full-scale anaerobic digestion plants were evaluated on DNA (total community) and RNA (active community) level via 16S rRNA (gene) amplicon sequencing.
Results:
A significantly higher diversity on DNA compared with the RNA level was observed for archaea, but not for bacteria. Beta diversity analysis showed a significant difference in community composition between the DNA and RNA of both bacteria and archaea. This related with 25.5 and 42.3% of total OTUs for bacteria and archaea, respectively, that showed a significant difference in their DNA and RNA profiles. Similar operational parameters affected the bacterial and archaeal community, yet the differentiating effect between DNA and RNA was much stronger for archaea. Co-occurrence networks and functional prediction profiling confirmed the clear differentiation between DNA and RNA profiles.
Conclusions:
In conclusion, a clear difference in active (RNA) and total (DNA) community profiles was observed, implying the need for a combined approach to estimate community stability in anaerobic digestion
- …