Search CORE

4,769 research outputs found

Graph Summarization

Author: Bonifati Angela
Dumbrava Stefania
Kondylakis Haridimos
Publication venue
Publication date: 01/04/2020
Field of study

The continuous and rapid growth of highly interconnected datasets, which are both voluminous and complex, calls for the development of adequate processing and analytical techniques. One method for condensing and simplifying such datasets is graph summarization. It denotes a series of application-specific algorithms designed to transform graphs into more compact representations while preserving structural patterns, query answers, or specific property distributions. As this problem is common to several areas studying graph topologies, different approaches, such as clustering, compression, sampling, or influence detection, have been proposed, primarily based on statistical and optimization methods. The focus of our chapter is to pinpoint the main graph summarization methods, but especially to focus on the most recent approaches and novel research trends on this topic, not yet covered by previous surveys.Comment: To appear in the Encyclopedia of Big Data Technologie

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL

Hal-Diderot

Synchronization in complex networks

Author: Acebrón
Albert
Albert Díaz-Guilera
Alex Arenas
Almendral
Amritkar
Anderson
Andrade
Arenas
Arenas
Arenas
Arenas
Arenas
Ariaratnam
Atay
Atay
Atay
Bandyopadhyay
Barabási
Barabási
Barahona
Barrat
Barthélemy
Basalto
Bassett
Bassett
Batista
Belykh
Belykh
Belykh
Bi
Bianconi
Binzegger
Blasius
Boccaletti
Boccaletti
Bollobás
Braitenberg
Braunstein
Brede
Buchanan
Buzsáki
Buzsáki
Calderón
Cartwright
Changsong Zhou
Chassin
Chavez
Cheeger
Chen
Chen
Chialvo
Chklovskii
Chung
Chung
Cohen
Costa
Crucitti
Crucitti
da Silva
Danon
Degesys
Denker
di Bernardo
Donetti
Donetti
Dorogovtsev
Dorogovtsev
Dorogovtsev
Dorogovtsev
Dunne
Díaz-Guilera
Earn
Edwards
Eguiluz
Elowitz
Engel
Erdös
Felleman
Fiedler
Filatrella
Fink
Forbes
Fox
Fries
Fukuda
Gade
Gade
García-Ojalvo
Garlaschelli
Garlaschelli
Garlaschelli
Gfeller
Girvan
Gleiser
Goh
Golomb
Grassly
Grenfell
Grinstein
Gross
Guardiola
Guclu
Guimerà
Guimerà
Guo
Gómez-Gardeñes
Gómez-Gardeñes
Gómez-Gardeñes
Gómez-Gardeñes
Hall
Hekmat
Helbing
Helbing
Helbing
Hilgetag
Hilgetag
Hodgkin
Honey
Hong
Hong
Hong
Huang
Hwang
Ichinomiya
Ichinomiya
Ito
Ito
Jalan
Jalan
Jeong
Jost
Jurgen Kurths
Kaiser
Karbowski
Keith
Kim
Kim
Kim
Konig
Kori
Korniss
Korniss
Koseska
Kuramoto
Kuramoto
Lago-Fernández
Lai
Lee
Leibold
Levnajic
Leyva
Li
Li
Li
Li
Li
Lind
Lodato
Lu
Lämmer
Macdonald
Mantegna
Markram
Marro
Martí
Maser
Masoller
Matías
McGraw
McGraw
Miyano
Mohar
Mohar
Mohar
Monasson
Moran
Moreno
Moreno
Moreno
Motter
Motter
Motter
Motter
Muñoz
Nagatani
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Newman
Nicol
Niebur
Nishikawa
Nishikawa
Nishikawa
Oh
Oh
Olfati-Saber
Olfati-Saber
Onnela
Osipov
Ott
Parker
Pecora
Pecora
Pikovsky
Pluchino
Popovych
Radicchi
Rajan
Ranta
Reijneveld
Restrepo
Restrepo
Restrepo
Restrepo
Restrepo
Restrepo
Ripa
Rodgers
Roxin
Sachtjen
Salvador
Scannell
Scirè
Serrano
Sik
Sik
Silberberg
Simeone
Sivrikaya
Sorrentino
Sporns
Sporns
Stam
Stam
Stenseth
Stephan
Steyvers
Strogatz
Strogatz
Strogatz
Strogatz
Tass
Travers
Vandermeer
Vragovic
Wagemakers
Wang
Wang
Watts
Watts
Wiener
Winfree
Winfree
Wu
Wu
Wu
Wu
Yamir Moreno
Yin
Yook
Yook
Zamora-Lopez
Zemanová
Zhao
Zhao
Zhao
Zhou
Zhou
Zhou
Zhou
Zhou
Zhou
Zou
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

Synchronization processes in populations of locally interacting elements are in the focus of intense research in physical, biological, chemical, technological and social systems. The many efforts devoted to understand synchronization phenomena in natural systems take now advantage of the recent theory of complex networks. In this review, we report the advances in the comprehension of synchronization phenomena when oscillating elements are constrained to interact in a complex network topology. We also overview the new emergent features coming out from the interplay between the structure and the function of the underlying pattern of connections. Extensive numerical work as well as analytical approaches to the problem are presented. Finally, we review several applications of synchronization in complex networks to different disciplines: biological systems and neuroscience, engineering and computer science, and economy and social sciences.Comment: Final version published in Physics Reports. More information available at http://synchronets.googlepages.com

arXiv.org e-Print Archive

CiteSeerX

Crossref

Institute of Mathematics AS CR, v. v. i.

A Bayesian alternative to mutual information for the hierarchical clustering of dependent random variables

Author: Bellec Pierre
Marrelec Guillaume
Messé Arnaud
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

The use of mutual information as a similarity measure in agglomerative hierarchical clustering (AHC) raises an important issue: some correction needs to be applied for the dimensionality of variables. In this work, we formulate the decision of merging dependent multivariate normal variables in an AHC procedure as a Bayesian model comparison. We found that the Bayesian formulation naturally shrinks the empirical covariance matrix towards a matrix set a priori (e.g., the identity), provides an automated stopping rule, and corrects for dimensionality using a term that scales up the measure as a function of the dimensionality of the variables. Also, the resulting log Bayes factor is asymptotically proportional to the plug-in estimate of mutual information, with an additive correction for dimensionality in agreement with the Bayesian information criterion. We investigated the behavior of these Bayesian alternatives (in exact and asymptotic forms) to mutual information on simulated and real data. An encouraging result was first derived on simulations: the hierarchical clustering based on the log Bayes factor outperformed off-the-shelf clustering techniques as well as raw and normalized mutual information in terms of classification accuracy. On a toy example, we found that the Bayesian approaches led to results that were similar to those of mutual information clustering techniques, with the advantage of an automated thresholding. On real functional magnetic resonance imaging (fMRI) datasets measuring brain activity, it identified clusters consistent with the established outcome of standard procedures. On this application, normalized mutual information had a highly atypical behavior, in the sense that it systematically favored very large clusters. These initial experiments suggest that the proposed Bayesian alternatives to mutual information are a useful new tool for hierarchical clustering

arXiv.org e-Print Archive

CiteSeerX

Crossref

HAL-Inserm

Directory of Open Access Journals

PubMed Central

FigShare

Twin Learning for Similarity and Clustering: A Unified Kernel Approach

Author: Cheng Qiang
Kang Zhao
Peng Chong
Publication venue
Publication date: 13/02/2017
Field of study

Many similarity-based clustering methods work in two separate steps including similarity matrix computation and subsequent spectral clustering. However, similarity measurement is challenging because it is usually impacted by many factors, e.g., the choice of similarity metric, neighborhood size, scale of data, noise and outliers. Thus the learned similarity matrix is often not suitable, let alone optimal, for the subsequent clustering. In addition, nonlinear similarity often exists in many real world data which, however, has not been effectively considered by most existing methods. To tackle these two challenges, we propose a model to simultaneously learn cluster indicator matrix and similarity information in kernel spaces in a principled way. We show theoretical relationships to kernel k-means, k-means, and spectral clustering methods. Then, to address the practical issue of how to select the most suitable kernel for a particular clustering task, we further extend our model with a multiple kernel learning ability. With this joint model, we can automatically accomplish three subtasks of finding the best cluster indicator matrix, the most accurate similarity relations and the optimal combination of multiple kernels. By leveraging the interactions between these three subtasks in a joint framework, each subtask can be iteratively boosted by using the results of the others towards an overall optimal solution. Extensive experiments are performed to demonstrate the effectiveness of our method.Comment: Published in AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Evaluating Overfit and Underfit in Models of Network Community Structure

Author: Clauset Aaron
Ghasemian Amir
Hosseinmardi Homa
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

A common data mining task on networks is community detection, which seeks an unsupervised decomposition of a network into structural groups based on statistical regularities in the network's connectivity. Although many methods exist, the No Free Lunch theorem for community detection implies that each makes some kind of tradeoff, and no algorithm can be optimal on all inputs. Thus, different algorithms will over or underfit on different inputs, finding more, fewer, or just different communities than is optimal, and evaluation methods that use a metadata partition as a ground truth will produce misleading conclusions about general accuracy. Here, we present a broad evaluation of over and underfitting in community detection, comparing the behavior of 16 state-of-the-art community detection algorithms on a novel and structurally diverse corpus of 406 real-world networks. We find that (i) algorithms vary widely both in the number of communities they find and in their corresponding composition, given the same input, (ii) algorithms can be clustered into distinct high-level groups based on similarities of their outputs on real-world networks, and (iii) these differences induce wide variation in accuracy on link prediction and link description tasks. We introduce a new diagnostic for evaluating overfitting and underfitting in practice, and use it to roughly divide community detection methods into general and specialized learning algorithms. Across methods and inputs, Bayesian techniques based on the stochastic block model and a minimum description length approach to regularization represent the best general learning approach, but can be outperformed under specific circumstances. These results introduce both a theoretically principled approach to evaluate over and underfitting in models of network community structure and a realistic benchmark by which new methods may be evaluated and compared.Comment: 22 pages, 13 figures, 3 table

arXiv.org e-Print Archive

Crossref