1,029 research outputs found

    A review of domain adaptation without target labels

    Full text link
    Domain adaptation has become a prominent problem setting in machine learning and related fields. This review asks the question: how can a classifier learn from a source domain and generalize to a target domain? We present a categorization of approaches, divided into, what we refer to as, sample-based, feature-based and inference-based methods. Sample-based methods focus on weighting individual observations during training based on their importance to the target domain. Feature-based methods revolve around on mapping, projecting and representing features such that a source classifier performs well on the target domain and inference-based methods incorporate adaptation into the parameter estimation procedure, for instance through constraints on the optimization procedure. Additionally, we review a number of conditions that allow for formulating bounds on the cross-domain generalization error. Our categorization highlights recurring ideas and raises questions important to further research.Comment: 20 pages, 5 figure

    Detection of an anomalous cluster in a network

    Full text link
    We consider the problem of detecting whether or not, in a given sensor network, there is a cluster of sensors which exhibit an "unusual behavior." Formally, suppose we are given a set of nodes and attach a random variable to each node. We observe a realization of this process and want to decide between the following two hypotheses: under the null, the variables are i.i.d. standard normal; under the alternative, there is a cluster of variables that are i.i.d. normal with positive mean and unit variance, while the rest are i.i.d. standard normal. We also address surveillance settings where each sensor in the network collects information over time. The resulting model is similar, now with a time series attached to each node. We again observe the process over time and want to decide between the null, where all the variables are i.i.d. standard normal, and the alternative, where there is an emerging cluster of i.i.d. normal variables with positive mean and unit variance. The growth models used to represent the emerging cluster are quite general and, in particular, include cellular automata used in modeling epidemics. In both settings, we consider classes of clusters that are quite general, for which we obtain a lower bound on their respective minimax detection rate and show that some form of scan statistic, by far the most popular method in practice, achieves that same rate to within a logarithmic factor. Our results are not limited to the normal location model, but generalize to any one-parameter exponential family when the anomalous clusters are large enough.Comment: Published in at http://dx.doi.org/10.1214/10-AOS839 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Persistence of the Jordan center in Random Growing Trees

    Full text link
    The Jordan center of a graph is defined as a vertex whose maximum distance to other nodes in the graph is minimal, and it finds applications in facility location and source detection problems. We study properties of the Jordan Center in the case of random growing trees. In particular, we consider a regular tree graph on which an infection starts from a root node and then spreads along the edges of the graph according to various random spread models. For the Independent Cascade (IC) model and the discrete Susceptible Infected (SI) model, both of which are discrete time models, we show that as the infected subgraph grows with time, the Jordan center persists on a single vertex after a finite number of timesteps. Finally, we also study the continuous time version of the SI model and bound the maximum distance between the Jordan center and the root node at any time.Comment: 28 pages, 14 figure

    Minimax network location : theory and algorithms

    Get PDF
    Originally presented as the author's Ph. D. thesis, M.I.T. Dept. of Aeronautics and Astronautics, 1974August 1974Includes bibliographical references (leaves 122-126)For a given network let P and N denote the set of all points and the set of all nodes respectively. Let G and T denote a cyclic network and a tree network respectively and let m denote the number of centers available. The categorization scheme P N/P N/m/G T, where the first and second cells refer to the possible locations of centers and demand generating points respectively, provides for compact identification of a variety of minimax network location problems. This dissertation presents algorithms which efficiently solve all problems in this class--for example, P/P/m/G-for virtually any size of network. Moreover, tree problems can usually be solved manually. Methodologically, the tree-based results are graph-theoretic while the general case, formulated in a mathematical programming framework, leads to a highly efficient strategy for a class of massive generalized set covering problems

    The optimal location of facilities on a network

    Get PDF
    Imperial Users onl

    A review of network location theory and models

    Get PDF
    Cataloged from PDF version of article.In this study, we review the existing literature on network location problems. The study has a broad scope that includes problems featuring desirable and undesirable facilities, point facilities and extensive facilities, monopolistic and competitive markets, and single or multiple objectives. Deterministic and stochastic models as well as robust models are covered. Demand data aggregation is also discussed. More than 500 papers in this area are reviewed and critical issues, research directions, and problem extensions are emphasized.Erdoğan, Damla SelinM.S

    Applications of Optimization Under Uncertainty Methods on Power System Planning Problems

    Get PDF
    This dissertation consists of two published journal paper, both on transmission expansion planning, and a report on distribution network hardening. We first discuss our studies of two optimization criteria for the transmission planning problem with a simplified representation of load and the forecast generation investment additions within the robust optimization paradigm. The objective is to determine either the minimum of the maximum investment requirement or the maximum regret with all sources of uncertainty explicitly represented. In this way, transmission planners can determine optimal planning decisions that are robust against all sources of uncertainty. We use a two layer algorithm to solve the resulting trilevel optimization problems. We also construct a new robust transmission planning model that considers generation investment more realistically to improve the quantification and visualization of uncertainty and the impacts of environmental policies. With this model, we can explore the effect of uncertainty in both the size and the location of candidate generation additions. The corresponding algorithm we develop takes advantage of the structural characteristics of the model so as to obtain a computationally efficient methodology. The two robust optimization tools provide new capabilities to transmission planners for the development of strategies that explicitly account for various sources of uncertainty. We illustrate the application of the two optimization models and solution schemes on a set of representative case studies. These studies give a good idea of the usefulness of these tools and show their practical worth in the assessment of ``what if\u27\u27 cases. We compare the performance of the minimax cost approach and the minimax regret approach under different characterizations of uncertain parameters. In addition, we also present extensive numerical studies on an IEEE 118-bus test system and the WECC 240-bus system to illustrate the effectiveness of the proposed decision support methods. The case study results are particularly useful to understand the impacts of each individual investment plan on the power system\u27s overall transmission adequacy in meeting the demand of the trade with the power output units without violation of the physical limits of the grid. In the report on distribution network hardening, a two-stage stochastic optimization model is proposed. Transmission and distribution networks are essential infrastructures to modern society. In the United States alone, there are there are more than 200,000 miles of high voltage transmission lines and numerous distribution lines. The power network spans the whole country. Such vast networks are vulnerable to disruptions caused by natural disasters. Hardening of distribution lines could significantly reduce the impact of natural disasters on the operation of power systems. However, due to the limited budget, it is impossible to upgrade the whole power network. Thus, intelligent allocation of resources is crucial. Optimal allocation of limited budget between different hardening methods on different distribution lines is explored
    corecore