2,121 research outputs found
MANCaLog: A Logic for Multi-Attribute Network Cascades (Technical Report)
The modeling of cascade processes in multi-agent systems in the form of
complex networks has in recent years become an important topic of study due to
its many applications: the adoption of commercial products, spread of disease,
the diffusion of an idea, etc. In this paper, we begin by identifying a
desiderata of seven properties that a framework for modeling such processes
should satisfy: the ability to represent attributes of both nodes and edges, an
explicit representation of time, the ability to represent non-Markovian
temporal relationships, representation of uncertain information, the ability to
represent competing cascades, allowance of non-monotonic diffusion, and
computational tractability. We then present the MANCaLog language, a formalism
based on logic programming that satisfies all these desiderata, and focus on
algorithms for finding minimal models (from which the outcome of cascades can
be obtained) as well as how this formalism can be applied in real world
scenarios. We are not aware of any other formalism in the literature that meets
all of the above requirements
Smart Algorithms for Hierarchical Clustering in Optical Network
Network design process is a very important in order to balance between the investment in the network and the supervises offered to the network user, taking into consideration, both minimizing the network investment cost, on the other hand, maximizing the quality of service offered to the customers as well.Partitioning the network to smaller sub-networks called clusters is required during the design process inorder to ease studying the whole network and achieve the design process as a trade-off between several atrtributes such as quality of service, reliability,cost, and management. Under CANON, a large scale optical network is partitioned into a number of geographically limited areas taking into account many different criteria like administrative domains, topological characteristics, traffic patterns, legacy infrastructure etc. An important consideration is that each of these clusters is comprised of a group of nodes in geographical proximity. The clusters can coincide with administrative domains but there could be many cases where two or more clusters belong to the same administrative domain. Therefore, in the most general case the partitioning into specific clusters can be either a off-line or a on-line process. In this work only the off-line case is considered. In this Study, we look at the problem of designing efficient 2- level Hierarchical Optical Networks (HON), in the context of network costs optimization. 2-level HON paradigm only have local rings to connect disjoint sets of nodes and a global sub mesh to interconnect all the local rings. We present an Hierarchical algorithm that is based on two phases. We present results for scenarios containing a set of real optical topologies
Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies
We explore the trade-offs of performing linear algebra using Apache Spark,
compared to traditional C and MPI implementations on HPC platforms. Spark is
designed for data analytics on cluster computing platforms with access to local
disks and is optimized for data-parallel tasks. We examine three widely-used
and important matrix factorizations: NMF (for physical plausability), PCA (for
its ubiquity) and CX (for data interpretability). We apply these methods to
TB-sized problems in particle physics, climate modeling and bioimaging. The
data matrices are tall-and-skinny which enable the algorithms to map
conveniently into Spark's data-parallel model. We perform scaling experiments
on up to 1600 Cray XC40 nodes, describe the sources of slowdowns, and provide
tuning guidance to obtain high performance
Describing and Understanding Neighborhood Characteristics through Online Social Media
Geotagged data can be used to describe regions in the world and discover
local themes. However, not all data produced within a region is necessarily
specifically descriptive of that area. To surface the content that is
characteristic for a region, we present the geographical hierarchy model (GHM),
a probabilistic model based on the assumption that data observed in a region is
a random mixture of content that pertains to different levels of a hierarchy.
We apply the GHM to a dataset of 8 million Flickr photos in order to
discriminate between content (i.e., tags) that specifically characterizes a
region (e.g., neighborhood) and content that characterizes surrounding areas or
more general themes. Knowledge of the discriminative and non-discriminative
terms used throughout the hierarchy enables us to quantify the uniqueness of a
given region and to compare similar but distant regions. Our evaluation
demonstrates that our model improves upon traditional Naive Bayes
classification by 47% and hierarchical TF-IDF by 27%. We further highlight the
differences and commonalities with human reasoning about what is locally
characteristic for a neighborhood, distilled from ten interviews and a survey
that covered themes such as time, events, and prior regional knowledgeComment: Accepted in WWW 2015, 2015, Florence, Ital
- …