19 research outputs found
Model Selection in Overlapping Stochastic Block Models
Networks are a commonly used mathematical model to describe the rich set of
interactions between objects of interest. Many clustering methods have been
developed in order to partition such structures, among which several rely on
underlying probabilistic models, typically mixture models. The relevant hidden
structure may however show overlapping groups in several applications. The
Overlapping Stochastic Block Model (2011) has been developed to take this
phenomenon into account. Nevertheless, the problem of the choice of the number
of classes in the inference step is still open. To tackle this issue, we
consider the proposed model in a Bayesian framework and develop a new criterion
based on a non asymptotic approximation of the marginal log-likelihood. We
describe how the criterion can be computed through a variational Bayes EM
algorithm, and demonstrate its efficiency by running it on both simulated and
real data.Comment: articl
Packing and covering immersion models of planar subcubic graphs
A graph is an immersion of a graph if can be obtained by some
sugraph after lifting incident edges. We prove that there is a polynomial
function , such that if is a
connected planar subcubic graph on edges, is a graph, and is a
non-negative integer, then either contains vertex/edge-disjoint
subgraphs, each containing as an immersion, or contains a set of
vertices/edges such that does not contain as an
immersion
On Bubble Generators in Directed Graphs
International audienceBubbles are pairs of internally vertex-disjoint (s, t)-paths with applications in the processing of DNA and RNA data. For example, enumerating alternative splicing events in a reference-free context can be done by enumerating all bubbles in a de Bruijn graph built from RNA-seq reads [16]. However, listing and analysing all bubbles in a given graph is usually unfeasible in practice, due to the exponential number of bubbles present in real data graphs. In this paper, we propose a notion of a bubble generator set, i.e. a polynomial-sized subset of bubbles from which all the others can be obtained through the application of a specific symmetric difference operator. This set provides a compact representation of the bubble space of a graph, which can be useful in practice since some pertinent information about all the bubbles can be more conveniently extracted from this compact set. Furthermore, we provide a polynomial-time algorithm to decompose any bubble of a graph into the bubbles of such a generator in a tree-like fashion
A Family of Tree-Based Generators for Bubbles in Directed Graphs
International audienceBubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on the previous generator: generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new generators, we also report some experimental results on a real dataset