13,106 research outputs found
Spectral Graph Forge: Graph Generation Targeting Modularity
Community structure is an important property that captures inhomogeneities
common in large networks, and modularity is one of the most widely used metrics
for such community structure. In this paper, we introduce a principled
methodology, the Spectral Graph Forge, for generating random graphs that
preserves community structure from a real network of interest, in terms of
modularity. Our approach leverages the fact that the spectral structure of
matrix representations of a graph encodes global information about community
structure. The Spectral Graph Forge uses a low-rank approximation of the
modularity matrix to generate synthetic graphs that match a target modularity
within user-selectable degree of accuracy, while allowing other aspects of
structure to vary. We show that the Spectral Graph Forge outperforms
state-of-the-art techniques in terms of accuracy in targeting the modularity
and randomness of the realizations, while also preserving other local
structural properties and node attributes. We discuss extensions of the
Spectral Graph Forge to target other properties beyond modularity, and its
applications to anonymization
Towards a property graph generator for benchmarking
The use of synthetic graph generators is a common practice among
graph-oriented benchmark designers, as it allows obtaining graphs with the
required scale and characteristics. However, finding a graph generator that
accurately fits the needs of a given benchmark is very difficult, thus
practitioners end up creating ad-hoc ones. Such a task is usually
time-consuming, and often leads to reinventing the wheel. In this paper, we
introduce the conceptual design of DataSynth, a framework for property graphs
generation with customizable schemas and characteristics. The goal of DataSynth
is to assist benchmark designers in generating graphs efficiently and at scale,
saving from implementing their own generators. Additionally, DataSynth
introduces novel features barely explored so far, such as modeling the
correlation between properties and the structure of the graph. This is achieved
by a novel property-to-node matching algorithm for which we present preliminary
promising results
Private Graph Data Release: A Survey
The application of graph analytics to various domains have yielded tremendous
societal and economical benefits in recent years. However, the increasingly
widespread adoption of graph analytics comes with a commensurate increase in
the need to protect private information in graph databases, especially in light
of the many privacy breaches in real-world graph data that was supposed to
preserve sensitive information. This paper provides a comprehensive survey of
private graph data release algorithms that seek to achieve the fine balance
between privacy and utility, with a specific focus on provably private
mechanisms. Many of these mechanisms fall under natural extensions of the
Differential Privacy framework to graph data, but we also investigate more
general privacy formulations like Pufferfish Privacy that can deal with the
limitations of Differential Privacy. A wide-ranging survey of the applications
of private graph data release mechanisms to social networks, finance, supply
chain, health and energy is also provided. This survey paper and the taxonomy
it provides should benefit practitioners and researchers alike in the
increasingly important area of private graph data release and analysis
A statistical model for brain networks inferred from large-scale electrophysiological signals
Network science has been extensively developed to characterize structural
properties of complex systems, including brain networks inferred from
neuroimaging data. As a result of the inference process, networks estimated
from experimentally obtained biological data, represent one instance of a
larger number of realizations with similar intrinsic topology. A modeling
approach is therefore needed to support statistical inference on the bottom-up
local connectivity mechanisms influencing the formation of the estimated brain
networks. We adopted a statistical model based on exponential random graphs
(ERGM) to reproduce brain networks, or connectomes, estimated by spectral
coherence between high-density electroencephalographic (EEG) signals. We
validated this approach in a dataset of 108 healthy subjects during eyes-open
(EO) and eyes-closed (EC) resting-state conditions. Results showed that the
tendency to form triangles and stars, reflecting clustering and node
centrality, better explained the global properties of the EEG connectomes as
compared to other combinations of graph metrics. Synthetic networks generated
by this model configuration replicated the characteristic differences found in
brain networks, with EO eliciting significantly higher segregation in the alpha
frequency band (8-13 Hz) as compared to EC. Furthermore, the fitted ERGM
parameter values provided complementary information showing that clustering
connections are significantly more represented from EC to EO in the alpha
range, but also in the beta band (14-29 Hz), which is known to play a crucial
role in cortical processing of visual input and externally oriented attention.
These findings support the current view of the brain functional segregation and
integration in terms of modules and hubs, and provide a statistical approach to
extract new information on the (re)organizational mechanisms in healthy and
diseased brains.Comment: Due to the limitation "The abstract field cannot be longer than 1,920
characters", the abstract appearing here is slightly shorter than that in the
PDF fil
- …