23 research outputs found

    Link Graph Analysis for Adult Images Classification

    Full text link
    In order to protect an image search engine's users from undesirable results adult images' classifier should be built. The information about links from websites to images is employed to create such a classifier. These links are represented as a bipartite website-image graph. Each vertex is equipped with scores of adultness and decentness. The scores for image vertexes are initialized with zero, those for website vertexes are initialized according to a text-based website classifier. An iterative algorithm that propagates scores within a website-image graph is described. The scores obtained are used to classify images by choosing an appropriate threshold. The experiments on Internet-scale data have shown that the algorithm under consideration increases classification recall by 17% in comparison with a simple algorithm which classifies an image as adult if it is connected with at least one adult site (at the same precision level).Comment: 7 pages. Young Scientists Conference, 4th Russian Summer School in Information Retrieva

    Network Capacity Bound for Personalized PageRank in Multimodal Networks

    Full text link
    In a former paper the concept of Bipartite PageRank was introduced and a theorem on the limit of authority flowing between nodes for personalized PageRank has been generalized. In this paper we want to extend those results to multimodal networks. In particular we introduce a hypergraph type that may be used for describing multimodal network where a hyperlink connects nodes from each of the modalities. We introduce a generalisation of PageRank for such graphs and define the respective random walk model that can be used for computations. we finally state and prove theorems on the limit of outflow of authority for cases where individual modalities have identical and distinct damping factors.Comment: 28 pages. arXiv admin note: text overlap with arXiv:1702.0373

    Relevance Search via Bipolar Label Diffusion on Bipartite Graphs

    Get PDF
    The task of relevance search is to find relevant items to some given queries which can be viewed either as an information retrieval problem or as a semi-supervised learning problem In order to combine both of their advantages we develop a new relevance search method using label diffusion on bipartite graphs And we propose a heat diffusion-based algorithm namely bipartite label diffusion BLD Our method yields encouraging experimental results on a number of relevance search problem

    FLEET: Butterfly Estimation from a Bipartite Graph Stream

    Full text link
    We consider space-efficient single-pass estimation of the number of butterflies, a fundamental bipartite graph motif, from a massive bipartite graph stream where each edge represents a connection between entities in two different partitions. We present a space lower bound for any streaming algorithm that can estimate the number of butterflies accurately, as well as FLEET, a suite of algorithms for accurately estimating the number of butterflies in the graph stream. Estimates returned by the algorithms come with provable guarantees on the approximation error, and experiments show good tradeoffs between the space used and the accuracy of approximation. We also present space-efficient algorithms for estimating the number of butterflies within a sliding window of the most recent elements in the stream. While there is a significant body of work on counting subgraphs such as triangles in a unipartite graph stream, our work seems to be one of the few to tackle the case of bipartite graph streams.Comment: This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Seyed-Vahid Sanei-Mehri, Yu Zhang, Ahmet Erdem Sariyuce and Srikanta Tirthapura. "FLEET: Butterfly Estimation from a Bipartite Graph Stream". The 28th ACM International Conference on Information and Knowledge Managemen

    University-Industry Technology Transfer: Empirical Findings from Chinese Industrial Firms

    Get PDF
    The knowledge and innovation generated by researchers at universities is transferred to industries through patent licensing, leading to the commercialization of academic output. In order to investigate the development of Chinese university-industry technology transfer and whether this kind of collaboration may affect a firm's innovation output, we collected approximately 6400 license contracts made between more than 4000 Chinese firms and 300 Chinese universities for the period between 2009 and 2014. This is the first study on Chinese university-industry knowledge transfer using a bipartite social network analysis (SNA) method, which emphasizes centrality estimates. We are able to investigate empirically how patent license transfer behavior may affect each firm's innovative output by allocating a centrality score to each firm in the university-firm technology transfer network. We elucidate the academic-industry knowledge by visualizing flow patterns for different regions with the SNA tool, Gephi. We find that innovation capabilities, R&D resources, and technology transfer performance all vary across China, and that patent licensing networks present clear small-world phenomena. We also highlight the Bipartite Graph Reinforcement Model (BGRM) and BiRank centrality in the bipartite network. Our empirical results reveal that firms with high BGRM and BiRank centrality scores, long history, and fewer employees have greater innovative output
    corecore