5,595 research outputs found

    Mass Volume Curves and Anomaly Ranking

    Full text link
    This paper aims at formulating the issue of ranking multivariate unlabeled observations depending on their degree of abnormality as an unsupervised statistical learning task. In the 1-d situation, this problem is usually tackled by means of tail estimation techniques: univariate observations are viewed as all the more `abnormal' as they are located far in the tail(s) of the underlying probability distribution. It would be desirable as well to dispose of a scalar valued `scoring' function allowing for comparing the degree of abnormality of multivariate observations. Here we formulate the issue of scoring anomalies as a M-estimation problem by means of a novel functional performance criterion, referred to as the Mass Volume curve (MV curve in short), whose optimal elements are strictly increasing transforms of the density almost everywhere on the support of the density. We first study the statistical estimation of the MV curve of a given scoring function and we provide a strategy to build confidence regions using a smoothed bootstrap approach. Optimization of this functional criterion over the set of piecewise constant scoring functions is next tackled. This boils down to estimating a sequence of empirical minimum volume sets whose levels are chosen adaptively from the data, so as to adjust to the variations of the optimal MV curve, while controling the bias of its approximation by a stepwise curve. Generalization bounds are then established for the difference in sup norm between the MV curve of the empirical scoring function thus obtained and the optimal MV curve

    Redistribution Through Public Employment: The Case of Italy

    Get PDF
    This paper examines the regional distribution of public employment in Italy. It documents two facts. The first is that public employment is used as a subsidy from the North to the less wealthy South. About half of the wage bill in the South of Italy can be identified as a subsidy. Both the size of public employment and the level of wages are used as a redistributive device. The second fact concerns the effects of subsidized public employment on individuals' attitudes toward job search, education, "risk taking" activities, and so on. Public employment discourages the development of market activities in the South. Copyright 2002, International Monetary Fund

    CATMAID: collaborative annotation toolkit for massive amounts of image data

    Get PDF
    Summary: High-resolution, three-dimensional (3D) imaging of large biological specimens generates massive image datasets that are difficult to navigate, annotate and share effectively. Inspired by online mapping applications like GoogleMapsâ„¢, we developed a decentralized web interface that allows seamless navigation of arbitrarily large image stacks. Our interface provides means for online, collaborative annotation of the biological image data and seamless sharing of regions of interest by bookmarking. The CATMAID interface enables synchronized navigation through multiple registered datasets even at vastly different scales such as in comparisons between optical and electron microscopy

    The interface of gravity and quantum mechanics illuminated by Wigner phase space

    Full text link
    We provide an introduction into the formulation of non-relativistic quantum mechanics using the Wigner phase-space distribution function and apply this concept to two physical situations at the interface of quantum theory and general relativity: (i) the motion of an ensemble of cold atoms relevant to tests of the weak equivalence principle, and (ii) the Kasevich-Chu interferometer. In order to lay the foundations for this analysis we first present a representation-free description of the Kasevich-Chu interferometer based on unitary operators.Comment: 69 pages, 6 figures, minor changes to match the published version. The original publication is available at http://en.sif.it/books/series/proceedings_fermi or http://ebooks.iospress.nl/volumearticle/3809

    On Zero-One and Convergence Laws for Graphs Embeddable on a Fixed Surface

    Get PDF
    We show that for no surface except for the plane does monadic second-order logic (MSO) have a zero-one-law - and not even a convergence law - on the class of (connected) graphs embeddable on the surface. In addition we show that every rational in [0,1] is the limiting probability of some MSO formula. This strongly refutes a conjecture by Heinig et al. (2014) who proved a convergence law for planar graphs, and a zero-one law for connected planar graphs, and also identified the so-called gaps of [0,1]: the subintervals that are not limiting probabilities of any MSO formula. The proof relies on a combination of methods from structural graph theory, especially large face-width embeddings of graphs on surfaces, analytic combinatorics, and finite model theory, and several parts of the proof may be of independent interest. In particular, we identify precisely the properties that make the zero-one law work on planar graphs but fail for every other surface

    Automated Design of a Correction Dipole Magnet for LHC

    Get PDF
    A correction dipole magnet, with a horizontal dipole nested inside a vertical dipole has been designed and optimized linking together different electromagnetic software and CAD/CAM systems. The necessary interfaces have recently been established in the program ROXIE which has been developed at CERN for the automatic generation and optimization of superconducting coil geometries. The program provides, in addition to a mathematical optimization chest, interfaces to commercial electromagnetic and structural software packages, CAD/CAM and databases. The results from electromagnetic calculations with different programs have been compared. Some modelling considerations to reduce the computation time are also given

    The Single Row Routing Problem Revisited: A Solution Based on Genetic Algorithms

    Get PDF
    With the advent of VLSI technology, circuits with more than one million transistors have been integrated onto a single chip. As the complexity of ICs grows, the time and money spent on designing the circuits become more important. A large, often dominant, part of the cost and time required to design an IC is consumed in the routing operation. The routing of carriers, such as in IC chips and printed circuit boards, is a classical problem in Computer Aided Design. With the complexity inherent in VLSI circuits, high performance routers are necessary. In this paper, a crucial step in the channel routing technique, the single row routing (SRR) problem, is considered. First, we discuss the relevance of SRR in the context of the general routing problem. Secondly, we show that heuristic algorithms are far from solving the general problem. Next, we introduce evolutionary computation, and, in particular, genetic algorithms (GAs) as a justifiable method in solving the SRR problem. Finally, an efficient O(nk) complexity technique based on GAs heuristic is obtained to solve the general SRR problem containing n nodes. Experimental results show that the algorithm is faster and can often generate better results than many of the leading heuristics proposed in the literature

    Genome-wide DNA polymorphism analyses using VariScan

    Get PDF
    BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data
    • …
    corecore