242,061 research outputs found
Improved Heterogeneous Distance Functions
Instance-based learning techniques typically handle continuous and linear
input values well, but often do not handle nominal input attributes
appropriately. The Value Difference Metric (VDM) was designed to find
reasonable distance values between nominal attribute values, but it largely
ignores continuous attributes, requiring discretization to map continuous
values into nominal values. This paper proposes three new heterogeneous
distance functions, called the Heterogeneous Value Difference Metric (HVDM),
the Interpolated Value Difference Metric (IVDM), and the Windowed Value
Difference Metric (WVDM). These new distance functions are designed to handle
applications with nominal attributes, continuous attributes, or both. In
experiments on 48 applications the new distance metrics achieve higher
classification accuracy on average than three previous distance functions on
those datasets that have both nominal and continuous attributes.Comment: See http://www.jair.org/ for an online appendix and other files
accompanying this articl
Upscaling Polymer Flooding in Heterogeneous Reservoirs
Imperial Users onl
Diffusion Component Analysis: Unraveling Functional Topology in Biological Networks
Complex biological systems have been successfully modeled by biochemical and
genetic interaction networks, typically gathered from high-throughput (HTP)
data. These networks can be used to infer functional relationships between
genes or proteins. Using the intuition that the topological role of a gene in a
network relates to its biological function, local or diffusion based
"guilt-by-association" and graph-theoretic methods have had success in
inferring gene functions. Here we seek to improve function prediction by
integrating diffusion-based methods with a novel dimensionality reduction
technique to overcome the incomplete and noisy nature of network data. In this
paper, we introduce diffusion component analysis (DCA), a framework that plugs
in a diffusion model and learns a low-dimensional vector representation of each
node to encode the topological properties of a network. As a proof of concept,
we demonstrate DCA's substantial improvement over state-of-the-art
diffusion-based approaches in predicting protein function from molecular
interaction networks. Moreover, our DCA framework can integrate multiple
networks from heterogeneous sources, consisting of genomic information,
biochemical experiments and other resources, to even further improve function
prediction. Yet another layer of performance gain is achieved by integrating
the DCA framework with support vector machines that take our node vector
representations as features. Overall, our DCA framework provides a novel
representation of nodes in a network that can be used as a plug-in architecture
to other machine learning algorithms to decipher topological properties of and
obtain novel insights into interactomes.Comment: RECOMB 201
Optimized normal and distance matching for heterogeneous object modeling
This paper presents a new optimization methodology of material blending for heterogeneous object modeling by matching the material governing features for designing a heterogeneous object. The proposed method establishes point-to-point correspondence represented by a set of connecting lines between two material directrices. To blend the material features between the directrices, a heuristic optimization method developed with the objective is to maximize the sum of the inner products of the unit normals at the end points of the connecting lines and minimize the sum of the lengths of connecting lines. The geometric features with material information are matched to generate non-self-intersecting and non-twisted connecting surfaces. By subdividing the connecting lines into equal number of segments, a series of intermediate piecewise curves are generated to represent the material metamorphosis between the governing material features. Alternatively, a dynamic programming approach developed in our earlier work is presented for comparison purposes. Result and computational efficiency of the proposed heuristic method is also compared with earlier techniques in the literature. Computer interface implementation and illustrative examples are also presented in this paper
HetHetNets: Heterogeneous Traffic Distribution in Heterogeneous Wireless Cellular Networks
A recent approach in modeling and analysis of the supply and demand in
heterogeneous wireless cellular networks has been the use of two independent
Poisson point processes (PPPs) for the locations of base stations (BSs) and
user equipments (UEs). This popular approach has two major shortcomings. First,
although the PPP model may be a fitting one for the BS locations, it is less
adequate for the UE locations mainly due to the fact that the model is not
adjustable (tunable) to represent the severity of the heterogeneity
(non-uniformity) in the UE locations. Besides, the independence assumption
between the two PPPs does not capture the often-observed correlation between
the UE and BS locations.
This paper presents a novel heterogeneous spatial traffic modeling which
allows statistical adjustment. Simple and non-parameterized, yet sufficiently
accurate, measures for capturing the traffic characteristics in space are
introduced. Only two statistical parameters related to the UE distribution,
namely, the coefficient of variation (the normalized second-moment), of an
appropriately defined inter-UE distance measure, and correlation coefficient
(the normalized cross-moment) between UE and BS locations, are adjusted to
control the degree of heterogeneity and the bias towards the BS locations,
respectively. This model is used in heterogeneous wireless cellular networks
(HetNets) to demonstrate the impact of heterogeneous and BS-correlated traffic
on the network performance. This network is called HetHetNet since it has two
types of heterogeneity: heterogeneity in the infrastructure (supply), and
heterogeneity in the spatial traffic distribution (demand).Comment: JSA
Classification of Message Spreading in a Heterogeneous Social Network
Nowadays, social networks such as Twitter, Facebook and LinkedIn become
increasingly popular. In fact, they introduced new habits, new ways of
communication and they collect every day several information that have
different sources. Most existing research works fo-cus on the analysis of
homogeneous social networks, i.e. we have a single type of node and link in the
network. However, in the real world, social networks offer several types of
nodes and links. Hence, with a view to preserve as much information as
possible, it is important to consider so-cial networks as heterogeneous and
uncertain. The goal of our paper is to classify the social message based on its
spreading in the network and the theory of belief functions. The proposed
classifier interprets the spread of messages on the network, crossed paths and
types of links. We tested our classifier on a real word network that we
collected from Twitter, and our experiments show the performance of our belief
classifier
- …