114,512 research outputs found
Online Social Networks: Measurements, Analysis and Solutions for Mining Challenges
In the last decade, online social networks showed enormous growth. With the rise
of these networks and the consequent availability of wealth social network data, Social
Network Analysis (SNA) led researchers to get the opportunity to access, analyse and
mine the social behaviour of millions of people, explore the way they communicate and
exchange information.
Despite the growing interest in analysing social networks, there are some challenges
and implications accompanying the analysis and mining of these networks. For example,
dealing with large-scale and evolving networks is not yet an easy task and still requires
a new mining solution. In addition, finding communities within these networks is a
challenging task and could open opportunities to see how people behave in groups on a
large scale. Also, the challenge of validating and optimizing communities without knowing
in advance the structure of the network due to the lack of ground truth is yet another
challenging barrier for validating the meaningfulness of the resulting communities.
In this thesis, we started by providing an overview of the necessary background and key
concepts required in the area of social networks analysis. Our main focus is to provide
solutions to tackle the key challenges in this area. For doing so, first, we introduce a predictive
technique to help in the prediction of the execution time of the analysis tasks for
evolving networks through employing predictive modeling techniques to the problem of
evolving and large-scale networks. Second, we study the performance of existing community
detection approaches to derive high quality community structure using a real email
network through analysing the exchange of emails and exploring community dynamics.
The aim is to study the community behavioral patterns and evaluate their quality within
an actual network. Finally, we propose an ensemble technique for deriving communities
using a rich internal enterprise real network in IBM that reflects real collaborations
and communications between employees. The technique aims to improve the community
detection process through the fusion of different algorithms
Abusive Language Detection in Online Conversations by Combining Content-and Graph-based Features
In recent years, online social networks have allowed worldwide users to meet
and discuss. As guarantors of these communities, the administrators of these
platforms must prevent users from adopting inappropriate behaviors. This
verification task, mainly done by humans, is more and more difficult due to the
ever growing amount of messages to check. Methods have been proposed to
automatize this moderation process, mainly by providing approaches based on the
textual content of the exchanged messages. Recent work has also shown that
characteristics derived from the structure of conversations, in the form of
conversational graphs, can help detecting these abusive messages. In this
paper, we propose to take advantage of both sources of information by proposing
fusion methods integrating content-and graph-based features. Our experiments on
raw chat logs show that the content of the messages, but also of their dynamics
within a conversation contain partially complementary information, allowing
performance improvements on an abusive message classification task with a final
F-measure of 93.26%
BL-MNE: Emerging Heterogeneous Social Network Embedding through Broad Learning with Aligned Autoencoder
Network embedding aims at projecting the network data into a low-dimensional
feature space, where the nodes are represented as a unique feature vector and
network structure can be effectively preserved. In recent years, more and more
online application service sites can be represented as massive and complex
networks, which are extremely challenging for traditional machine learning
algorithms to deal with. Effective embedding of the complex network data into
low-dimension feature representation can both save data storage space and
enable traditional machine learning algorithms applicable to handle the network
data. Network embedding performance will degrade greatly if the networks are of
a sparse structure, like the emerging networks with few connections. In this
paper, we propose to learn the embedding representation for a target emerging
network based on the broad learning setting, where the emerging network is
aligned with other external mature networks at the same time. To solve the
problem, a new embedding framework, namely "Deep alIgned autoencoder based
eMbEdding" (DIME), is introduced in this paper. DIME handles the diverse link
and attribute in a unified analytic based on broad learning, and introduces the
multiple aligned attributed heterogeneous social network concept to model the
network structure. A set of meta paths are introduced in the paper, which
define various kinds of connections among users via the heterogeneous link and
attribute information. The closeness among users in the networks are defined as
the meta proximity scores, which will be fed into DIME to learn the embedding
vectors of users in the emerging network. Extensive experiments have been done
on real-world aligned social networks, which have demonstrated the
effectiveness of DIME in learning the emerging network embedding vectors.Comment: 10 pages, 9 figures, 4 tables. Full paper is accepted by ICDM 2017,
In: Proceedings of the 2017 IEEE International Conference on Data Mining
The use of multilayer network analysis in animal behaviour
Network analysis has driven key developments in research on animal behaviour
by providing quantitative methods to study the social structures of animal
groups and populations. A recent formalism, known as \emph{multilayer network
analysis}, has advanced the study of multifaceted networked systems in many
disciplines. It offers novel ways to study and quantify animal behaviour as
connected 'layers' of interactions. In this article, we review common questions
in animal behaviour that can be studied using a multilayer approach, and we
link these questions to specific analyses. We outline the types of behavioural
data and questions that may be suitable to study using multilayer network
analysis. We detail several multilayer methods, which can provide new insights
into questions about animal sociality at individual, group, population, and
evolutionary levels of organisation. We give examples for how to implement
multilayer methods to demonstrate how taking a multilayer approach can alter
inferences about social structure and the positions of individuals within such
a structure. Finally, we discuss caveats to undertaking multilayer network
analysis in the study of animal social networks, and we call attention to
methodological challenges for the application of these approaches. Our aim is
to instigate the study of new questions about animal sociality using the new
toolbox of multilayer network analysis.Comment: Thoroughly revised; title changed slightl
- …