2,121 research outputs found

    MANCaLog: A Logic for Multi-Attribute Network Cascades (Technical Report)

    Get PDF
    The modeling of cascade processes in multi-agent systems in the form of complex networks has in recent years become an important topic of study due to its many applications: the adoption of commercial products, spread of disease, the diffusion of an idea, etc. In this paper, we begin by identifying a desiderata of seven properties that a framework for modeling such processes should satisfy: the ability to represent attributes of both nodes and edges, an explicit representation of time, the ability to represent non-Markovian temporal relationships, representation of uncertain information, the ability to represent competing cascades, allowance of non-monotonic diffusion, and computational tractability. We then present the MANCaLog language, a formalism based on logic programming that satisfies all these desiderata, and focus on algorithms for finding minimal models (from which the outcome of cascades can be obtained) as well as how this formalism can be applied in real world scenarios. We are not aware of any other formalism in the literature that meets all of the above requirements

    Smart Algorithms for Hierarchical Clustering in Optical Network

    Get PDF
    Network design process is a very important in order to balance between the investment in the network and the supervises offered to the network user, taking into consideration, both minimizing the network investment cost, on the other hand, maximizing the quality of service offered to the customers as well.Partitioning the network to smaller sub-networks called clusters is required during the design process inorder to ease studying the whole network and achieve the design process as a trade-off between several atrtributes such as quality of service, reliability,cost, and management. Under CANON, a large scale optical network is partitioned into a number of geographically limited areas taking into account many different criteria like administrative domains, topological characteristics, traffic patterns, legacy infrastructure etc. An important consideration is that each of these clusters is comprised of a group of nodes in geographical proximity. The clusters can coincide with administrative domains but there could be many cases where two or more clusters belong to the same administrative domain. Therefore, in the most general case the partitioning into specific clusters can be either a off-line or a on-line process. In this work only the off-line case is considered. In this Study, we look at the problem of designing efficient 2- level Hierarchical Optical Networks (HON), in the context of network costs optimization. 2-level HON paradigm only have local rings to connect disjoint sets of nodes and a global sub mesh to interconnect all the local rings. We present an Hierarchical algorithm that is based on two phases. We present results for scenarios containing a set of real optical topologies

    Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies

    Full text link
    We explore the trade-offs of performing linear algebra using Apache Spark, compared to traditional C and MPI implementations on HPC platforms. Spark is designed for data analytics on cluster computing platforms with access to local disks and is optimized for data-parallel tasks. We examine three widely-used and important matrix factorizations: NMF (for physical plausability), PCA (for its ubiquity) and CX (for data interpretability). We apply these methods to TB-sized problems in particle physics, climate modeling and bioimaging. The data matrices are tall-and-skinny which enable the algorithms to map conveniently into Spark's data-parallel model. We perform scaling experiments on up to 1600 Cray XC40 nodes, describe the sources of slowdowns, and provide tuning guidance to obtain high performance

    Describing and Understanding Neighborhood Characteristics through Online Social Media

    Full text link
    Geotagged data can be used to describe regions in the world and discover local themes. However, not all data produced within a region is necessarily specifically descriptive of that area. To surface the content that is characteristic for a region, we present the geographical hierarchy model (GHM), a probabilistic model based on the assumption that data observed in a region is a random mixture of content that pertains to different levels of a hierarchy. We apply the GHM to a dataset of 8 million Flickr photos in order to discriminate between content (i.e., tags) that specifically characterizes a region (e.g., neighborhood) and content that characterizes surrounding areas or more general themes. Knowledge of the discriminative and non-discriminative terms used throughout the hierarchy enables us to quantify the uniqueness of a given region and to compare similar but distant regions. Our evaluation demonstrates that our model improves upon traditional Naive Bayes classification by 47% and hierarchical TF-IDF by 27%. We further highlight the differences and commonalities with human reasoning about what is locally characteristic for a neighborhood, distilled from ten interviews and a survey that covered themes such as time, events, and prior regional knowledgeComment: Accepted in WWW 2015, 2015, Florence, Ital
    • …
    corecore