1,029 research outputs found
A review of domain adaptation without target labels
Domain adaptation has become a prominent problem setting in machine learning
and related fields. This review asks the question: how can a classifier learn
from a source domain and generalize to a target domain? We present a
categorization of approaches, divided into, what we refer to as, sample-based,
feature-based and inference-based methods. Sample-based methods focus on
weighting individual observations during training based on their importance to
the target domain. Feature-based methods revolve around on mapping, projecting
and representing features such that a source classifier performs well on the
target domain and inference-based methods incorporate adaptation into the
parameter estimation procedure, for instance through constraints on the
optimization procedure. Additionally, we review a number of conditions that
allow for formulating bounds on the cross-domain generalization error. Our
categorization highlights recurring ideas and raises questions important to
further research.Comment: 20 pages, 5 figure
Detection of an anomalous cluster in a network
We consider the problem of detecting whether or not, in a given sensor
network, there is a cluster of sensors which exhibit an "unusual behavior."
Formally, suppose we are given a set of nodes and attach a random variable to
each node. We observe a realization of this process and want to decide between
the following two hypotheses: under the null, the variables are i.i.d. standard
normal; under the alternative, there is a cluster of variables that are i.i.d.
normal with positive mean and unit variance, while the rest are i.i.d. standard
normal. We also address surveillance settings where each sensor in the network
collects information over time. The resulting model is similar, now with a time
series attached to each node. We again observe the process over time and want
to decide between the null, where all the variables are i.i.d. standard normal,
and the alternative, where there is an emerging cluster of i.i.d. normal
variables with positive mean and unit variance. The growth models used to
represent the emerging cluster are quite general and, in particular, include
cellular automata used in modeling epidemics. In both settings, we consider
classes of clusters that are quite general, for which we obtain a lower bound
on their respective minimax detection rate and show that some form of scan
statistic, by far the most popular method in practice, achieves that same rate
to within a logarithmic factor. Our results are not limited to the normal
location model, but generalize to any one-parameter exponential family when the
anomalous clusters are large enough.Comment: Published in at http://dx.doi.org/10.1214/10-AOS839 the Annals of
Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical
Statistics (http://www.imstat.org
Persistence of the Jordan center in Random Growing Trees
The Jordan center of a graph is defined as a vertex whose maximum distance to
other nodes in the graph is minimal, and it finds applications in facility
location and source detection problems. We study properties of the Jordan
Center in the case of random growing trees. In particular, we consider a
regular tree graph on which an infection starts from a root node and then
spreads along the edges of the graph according to various random spread models.
For the Independent Cascade (IC) model and the discrete Susceptible Infected
(SI) model, both of which are discrete time models, we show that as the
infected subgraph grows with time, the Jordan center persists on a single
vertex after a finite number of timesteps. Finally, we also study the
continuous time version of the SI model and bound the maximum distance between
the Jordan center and the root node at any time.Comment: 28 pages, 14 figure
Minimax network location : theory and algorithms
Originally presented as the author's Ph. D. thesis, M.I.T. Dept. of Aeronautics and Astronautics, 1974August 1974Includes bibliographical references (leaves 122-126)For a given network let P and N denote the set of all points and the set of all nodes respectively. Let G and T denote a cyclic network and a tree network respectively and let m denote the number of centers available. The categorization scheme P N/P N/m/G T, where the first and second cells refer to the possible locations of centers and demand generating points respectively, provides for compact identification of a variety of minimax network location problems. This dissertation presents algorithms which efficiently solve all problems in this class--for example, P/P/m/G-for virtually any size of network. Moreover, tree problems can usually be solved manually. Methodologically, the tree-based results are graph-theoretic while the general case, formulated in a mathematical programming framework, leads to a highly efficient strategy for a class of massive generalized set covering problems
The optimal location of facilities on a network
Imperial Users onl
A review of network location theory and models
Cataloged from PDF version of article.In this study, we review the existing literature on network location problems.
The study has a broad scope that includes problems featuring desirable and
undesirable facilities, point facilities and extensive facilities, monopolistic and
competitive markets, and single or multiple objectives. Deterministic and
stochastic models as well as robust models are covered. Demand data
aggregation is also discussed. More than 500 papers in this area are reviewed
and critical issues, research directions, and problem extensions are emphasized.Erdoğan, Damla SelinM.S
Applications of Optimization Under Uncertainty Methods on Power System Planning Problems
This dissertation consists of two published journal paper, both on transmission expansion planning, and a report on distribution network hardening.
We first discuss our studies of two optimization criteria for the transmission planning problem with a simplified representation of load and the forecast generation investment additions within the robust optimization paradigm. The objective is to determine either the minimum of the maximum investment requirement or the maximum regret with all sources of uncertainty explicitly represented. In this way, transmission planners can determine optimal planning decisions that are robust against all sources of uncertainty. We use a two layer algorithm to solve the resulting trilevel optimization problems. We also construct a new robust transmission planning model that considers generation investment more realistically to improve the quantification and visualization of uncertainty and the impacts of environmental policies. With this model, we can explore the effect of uncertainty in both the size and the location of candidate generation additions. The corresponding algorithm we develop takes advantage of the structural characteristics of the model so as to obtain a computationally efficient methodology. The two robust optimization tools provide new capabilities to transmission planners for the development of strategies that explicitly account for various sources of uncertainty.
We illustrate the application of the two optimization models and solution schemes on a set of representative case studies. These studies give a good idea of the usefulness of these tools and show their practical worth in the assessment of ``what if\u27\u27 cases. We compare the performance of the minimax cost approach and the minimax regret approach under different characterizations of uncertain parameters. In addition, we also present extensive numerical studies on an IEEE 118-bus test system and the WECC 240-bus system to illustrate the effectiveness of the proposed decision support methods. The case study results are particularly useful to understand the impacts of each individual investment plan on the power system\u27s overall transmission adequacy in meeting the demand of the trade with the power output units without violation of the physical limits of the grid.
In the report on distribution network hardening, a two-stage stochastic optimization model is proposed. Transmission and distribution networks are essential infrastructures to modern society. In the United States alone, there are there are more than 200,000 miles of high voltage transmission lines and numerous distribution lines. The power network spans the whole country. Such vast networks are vulnerable to disruptions caused by natural disasters. Hardening of distribution lines could significantly reduce the impact of natural disasters on the operation of power systems. However, due to the limited budget, it is impossible to upgrade the whole power network. Thus, intelligent allocation of resources is crucial. Optimal allocation of limited budget between different hardening methods on different distribution lines is explored
- …