25 research outputs found

    Upper tails and independence polynomials in random graphs

    Full text link
    The upper tail problem in the Erd\H{o}s--R\'enyi random graph GGn,pG\sim\mathcal{G}_{n,p} asks to estimate the probability that the number of copies of a graph HH in GG exceeds its expectation by a factor 1+δ1+\delta. Chatterjee and Dembo showed that in the sparse regime of p0p\to 0 as nn\to\infty with pnαp \geq n^{-\alpha} for an explicit α=αH>0\alpha=\alpha_H>0, this problem reduces to a natural variational problem on weighted graphs, which was thereafter asymptotically solved by two of the authors in the case where HH is a clique. Here we extend the latter work to any fixed graph HH and determine a function cH(δ)c_H(\delta) such that, for pp as above and any fixed δ>0\delta>0, the upper tail probability is exp[(cH(δ)+o(1))n2pΔlog(1/p)]\exp[-(c_H(\delta)+o(1))n^2 p^\Delta \log(1/p)], where Δ\Delta is the maximum degree of HH. As it turns out, the leading order constant in the large deviation rate function, cH(δ)c_H(\delta), is governed by the independence polynomial of HH, defined as PH(x)=iH(k)xkP_H(x)=\sum i_H(k) x^k where iH(k)i_H(k) is the number of independent sets of size kk in HH. For instance, if HH is a regular graph on mm vertices, then cH(δ)c_H(\delta) is the minimum between 12δ2/m\frac12 \delta^{2/m} and the unique positive solution of PH(x)=1+δP_H(x) = 1+\delta

    Bounds for the Generalized Distance Eigenvalues of a Graph

    Get PDF
    Let G be a simple undirected graph containing n vertices. Assume G is connected. Let D(G) be the distance matrix, DL(G) be the distance Laplacian, DQ(G) be the distance signless Laplacian, and Tr(G) be the diagonal matrix of the vertex transmissions, respectively. Furthermore, we denote by Dα(G) the generalized distance matrix, i.e., Dα(G)=αTr(G)+(1−α)D(G) , where α∈[0,1] . In this paper, we establish some new sharp bounds for the generalized distance spectral radius of G, making use of some graph parameters like the order n, the diameter, the minimum degree, the second minimum degree, the transmission degree, the second transmission degree and the parameter α , improving some bounds recently given in the literature. We also characterize the extremal graphs attaining these bounds. As an special cases of our results, we will be able to cover some of the bounds recently given in the literature for the case of distance matrix and distance signless Laplacian matrix. We also obtain new bounds for the k-th generalized distance eigenvalue

    Business Decision Insight With Causal Bayesian Networks

    Get PDF
    Causal Bayesian Networks are a widely recognised tool for modelling the uncer- tainty of a wide range of processes, particularly when the nature of how different factors influence each other. The practice of utilising Causal Bayesian Network is now becoming a growing trend for business that want to fully understand the demands im- posed on them, and how best to adapt their business in order to be successful. When designing and building a Causal Bayesian Network, it is often necessary to consult with domain experts for information about the shape of the model but also the defini- tion of how the causal factors influence others. The definition of these influences can require the specification of a large volume of probability distributions, even if a lot of evidential data is available for analysis. Whilst the definition of the structure of the model can be a relatively simple task for a domain expert, providing the probability distributions is a much more difficult task. In this thesis I discuss a method whereby, given a model structure, a domain expert can provide simple descriptive meta-data so that a hypothetical probability distribution can be generated for the discrete model variables

    Extending graph homomorphism and simulation for real life graph matching

    Get PDF
    Among the vital problems in a variety of emerging applications is the graph matching problem, which is to determine whether two graphs are similar, and if so, find all the valid matches in one graph for the other, based on specified metrics. Traditional graph matching approaches are mostly based on graph homomorphism and isomorphism, falling short of capturing both structural and semantic similarity in real life applications. Moreover, it is preferable while difficult to find all matches with high accuracy over complex graphs. Worse still, the graph structures in real life applications constantly bear modifications. In response to these challenges, this thesis presents a series of approaches for ef?ciently solving graph matching problems, over both static and dynamic real life graphs. Firstly, the thesis extends graph homomorphism and subgraph isomorphism, respectively, by mapping edges from one graph to paths in another, and by measuring the semantic similarity of nodes. The graph similarity is then measured by the metrics based on these extensions. Several optimization problems for graph matching based on the new metrics are studied, with approximation algorithms having provable guarantees on match quality developed. Secondly, although being extended in the above work, graph matching is defined in terms of functions, which cannot capture more meaningful matches and is usually hard to compute. In response to this, the thesis proposes a class of graph patterns, in which an edge denotes the connectivity in a data graph within a predefined number of hops. In addition, the thesis defines graph pattern matching based on a notion of bounded simulation relation, an extension of graph simulation. With this revision, graph pattern matching is in cubic-time by providing such an algorithm, rather than intractable. Thirdly, real life graphs often bear multiple edge types. In response to this, the thesis further extends and generalizes the proposed revisions of graph simulation to a more powerful case: a novel set of reachability queries and graph pattern queries, constrained by a subclass of regular path expressions. Several fundamental problems of the queries are studied: containment, equivalence and minimization. The enriched reachability query does not increase the complexity of the above problems, shown by the corresponding algorithms. Moreover, graph pattern queries can be evaluated in cubic time, where two such algorithms are proposed. Finally, real life graphs are frequently updated with small changes. The thesis investigates incremental algorithms for graph pattern matching defined in terms of graph simulation, bounded simulation and subgraph isomorphism. Besides studying the results on the complexity bounds, the thesis provides the experimental study verifying that these incremental algorithms significantly outperform their batch counterparts in response to small changes, using real-life data and synthetic data
    corecore