21,234 research outputs found
DEMON: a Local-First Discovery Method for Overlapping Communities
Community discovery in complex networks is an interesting problem with a
number of applications, especially in the knowledge extraction task in social
and information networks. However, many large networks often lack a particular
community organization at a global level. In these cases, traditional graph
partitioning algorithms fail to let the latent knowledge embedded in modular
structure emerge, because they impose a top-down global view of a network. We
propose here a simple local-first approach to community discovery, able to
unveil the modular organization of real complex networks. This is achieved by
democratically letting each node vote for the communities it sees surrounding
it in its limited view of the global system, i.e. its ego neighborhood, using a
label propagation algorithm; finally, the local communities are merged into a
global collection. We tested this intuition against the state-of-the-art
overlapping and non-overlapping community discovery methods, and found that our
new method clearly outperforms the others in the quality of the obtained
communities, evaluated by using the extracted communities to predict the
metadata about the nodes of several real world networks. We also show how our
method is deterministic, fully incremental, and has a limited time complexity,
so that it can be used on web-scale real networks.Comment: 9 pages; Proceedings of the 18th ACM SIGKDD International Conference
on Knowledge Discovery and Data Mining, Beijing, China, August 12-16, 201
Community structure of complex software systems: Analysis and applications
Due to notable discoveries in the fast evolving field of complex networks,
recent research in software engineering has also focused on representing
software systems with networks. Previous work has observed that these networks
follow scale-free degree distributions and reveal small-world phenomena, while
we here explore another property commonly found in different complex networks,
i.e. community structure. We adopt class dependency networks, where nodes
represent software classes and edges represent dependencies among them, and
show that these networks reveal a significant community structure,
characterized by similar properties as observed in other complex networks.
However, although intuitive and anticipated by different phenomena, identified
communities do not exactly correspond to software packages. We empirically
confirm our observations on several networks constructed from Java and various
third party libraries, and propose different applications of community
detection to software engineering
Communities, Knowledge Creation, and Information Diffusion
In this paper, we examine how patterns of scientific collaboration contribute
to knowledge creation. Recent studies have shown that scientists can benefit
from their position within collaborative networks by being able to receive more
information of better quality in a timely fashion, and by presiding over
communication between collaborators. Here we focus on the tendency of
scientists to cluster into tightly-knit communities, and discuss the
implications of this tendency for scientific performance. We begin by reviewing
a new method for finding communities, and we then assess its benefits in terms
of computation time and accuracy. While communities often serve as a taxonomic
scheme to map knowledge domains, they also affect how successfully scientists
engage in the creation of new knowledge. By drawing on the longstanding debate
on the relative benefits of social cohesion and brokerage, we discuss the
conditions that facilitate collaborations among scientists within or across
communities. We show that successful scientific production occurs within
communities when scientists have cohesive collaborations with others from the
same knowledge domain, and across communities when scientists intermediate
among otherwise disconnected collaborators from different knowledge domains. We
also discuss the implications of communities for information diffusion, and
show how traditional epidemiological approaches need to be refined to take
knowledge heterogeneity into account and preserve the system's ability to
promote creative processes of novel recombinations of idea
- …