5,981 research outputs found
Name Disambiguation from link data in a collaboration graph using temporal and topological features
In a social community, multiple persons may share the same name, phone number
or some other identifying attributes. This, along with other phenomena, such as
name abbreviation, name misspelling, and human error leads to erroneous
aggregation of records of multiple persons under a single reference. Such
mistakes affect the performance of document retrieval, web search, database
integration, and more importantly, improper attribution of credit (or blame).
The task of entity disambiguation partitions the records belonging to multiple
persons with the objective that each decomposed partition is composed of
records of a unique person. Existing solutions to this task use either
biographical attributes, or auxiliary features that are collected from external
sources, such as Wikipedia. However, for many scenarios, such auxiliary
features are not available, or they are costly to obtain. Besides, the attempt
of collecting biographical or external data sustains the risk of privacy
violation. In this work, we propose a method for solving entity disambiguation
task from link information obtained from a collaboration network. Our method is
non-intrusive of privacy as it uses only the time-stamped graph topology of an
anonymized network. Experimental results on two real-life academic
collaboration networks show that the proposed method has satisfactory
performance.Comment: The short version of this paper has been accepted to ASONAM 201
Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications
Wireless sensor networks monitor dynamic environments that change rapidly
over time. This dynamic behavior is either caused by external factors or
initiated by the system designers themselves. To adapt to such conditions,
sensor networks often adopt machine learning techniques to eliminate the need
for unnecessary redesign. Machine learning also inspires many practical
solutions that maximize resource utilization and prolong the lifespan of the
network. In this paper, we present an extensive literature review over the
period 2002-2013 of machine learning methods that were used to address common
issues in wireless sensor networks (WSNs). The advantages and disadvantages of
each proposed algorithm are evaluated against the corresponding problem. We
also provide a comparative guide to aid WSN designers in developing suitable
machine learning solutions for their specific application challenges.Comment: Accepted for publication in IEEE Communications Surveys and Tutorial
Testing the Martingale Difference Hypothesis Using Neural Network Approximations
The martingale difference restriction is an outcome of many theoretical analyses in economics and finance. A large body of econometric literature deals with tests of that restriction. We provide new tests based on radial basis function neural networks. Our work is based on the test design of Blake and Kapetanios (2000, 2003a,b). However, unlike that work we can provide a formal theoretical justification for the validity of these tests using approximation results from Kapetanios and Blake (2007). These results take advantage of the link between the algorithms of Blake and Kapetanios (2000, 2003a,b) and boosting. We carry out a Monte Carlo study of the properties of the new tests and find that they have superior power performance to all existing tests of the martingale difference hypothesis we consider. An empirical application to the S&P500 constituents illustrates the usefulness of our new test.Martingale difference hypothesis, Neural networks, Boosting
A distributed optimization framework for localization and formation control: applications to vision-based measurements
Multiagent systems have been a major area of research for the last 15 years. This interest has been motivated by tasks that can be executed more rapidly in a collaborative manner or that are nearly impossible to carry out otherwise. To be effective, the agents need to have the notion of a common goal shared by the entire network (for instance, a desired formation) and individual control laws to realize the goal. The common goal is typically centralized, in the sense that it involves the state of all the agents at the same time. On the other hand, it is often desirable to have individual control laws that are distributed, in the sense that the desired action of an agent depends only on the measurements and states available at the node and at a small number of neighbors. This is an attractive quality because it implies an overall system that is modular and intrinsically more robust to communication delays and node failures
Active network management for electrical distribution systems: problem formulation, benchmark, and approximate solution
With the increasing share of renewable and distributed generation in
electrical distribution systems, Active Network Management (ANM) becomes a
valuable option for a distribution system operator to operate his system in a
secure and cost-effective way without relying solely on network reinforcement.
ANM strategies are short-term policies that control the power injected by
generators and/or taken off by loads in order to avoid congestion or voltage
issues. Advanced ANM strategies imply that the system operator has to solve
large-scale optimal sequential decision-making problems under uncertainty. For
example, decisions taken at a given moment constrain the future decisions that
can be taken and uncertainty must be explicitly accounted for because neither
demand nor generation can be accurately forecasted. We first formulate the ANM
problem, which in addition to be sequential and uncertain, has a nonlinear
nature stemming from the power flow equations and a discrete nature arising
from the activation of power modulation signals. This ANM problem is then cast
as a stochastic mixed-integer nonlinear program, as well as second-order cone
and linear counterparts, for which we provide quantitative results using state
of the art solvers and perform a sensitivity analysis over the size of the
system, the amount of available flexibility, and the number of scenarios
considered in the deterministic equivalent of the stochastic program. To foster
further research on this problem, we make available at
http://www.montefiore.ulg.ac.be/~anm/ three test beds based on distribution
networks of 5, 33, and 77 buses. These test beds contain a simulator of the
distribution system, with stochastic models for the generation and consumption
devices, and callbacks to implement and test various ANM strategies
- …