5,595 research outputs found
Mass Volume Curves and Anomaly Ranking
This paper aims at formulating the issue of ranking multivariate unlabeled
observations depending on their degree of abnormality as an unsupervised
statistical learning task. In the 1-d situation, this problem is usually
tackled by means of tail estimation techniques: univariate observations are
viewed as all the more `abnormal' as they are located far in the tail(s) of the
underlying probability distribution. It would be desirable as well to dispose
of a scalar valued `scoring' function allowing for comparing the degree of
abnormality of multivariate observations. Here we formulate the issue of
scoring anomalies as a M-estimation problem by means of a novel functional
performance criterion, referred to as the Mass Volume curve (MV curve in
short), whose optimal elements are strictly increasing transforms of the
density almost everywhere on the support of the density. We first study the
statistical estimation of the MV curve of a given scoring function and we
provide a strategy to build confidence regions using a smoothed bootstrap
approach. Optimization of this functional criterion over the set of piecewise
constant scoring functions is next tackled. This boils down to estimating a
sequence of empirical minimum volume sets whose levels are chosen adaptively
from the data, so as to adjust to the variations of the optimal MV curve, while
controling the bias of its approximation by a stepwise curve. Generalization
bounds are then established for the difference in sup norm between the MV curve
of the empirical scoring function thus obtained and the optimal MV curve
Redistribution Through Public Employment: The Case of Italy
This paper examines the regional distribution of public employment in Italy. It documents two facts. The first is that public employment is used as a subsidy from the North to the less wealthy South. About half of the wage bill in the South of Italy can be identified as a subsidy. Both the size of public employment and the level of wages are used as a redistributive device. The second fact concerns the effects of subsidized public employment on individuals' attitudes toward job search, education, "risk taking" activities, and so on. Public employment discourages the development of market activities in the South. Copyright 2002, International Monetary Fund
CATMAID: collaborative annotation toolkit for massive amounts of image data
Summary: High-resolution, three-dimensional (3D) imaging of large biological specimens generates massive image datasets that are difficult to navigate, annotate and share effectively. Inspired by online mapping applications like GoogleMapsâ„¢, we developed a decentralized web interface that allows seamless navigation of arbitrarily large image stacks. Our interface provides means for online, collaborative annotation of the biological image data and seamless sharing of regions of interest by bookmarking. The CATMAID interface enables synchronized navigation through multiple registered datasets even at vastly different scales such as in comparisons between optical and electron microscopy
The interface of gravity and quantum mechanics illuminated by Wigner phase space
We provide an introduction into the formulation of non-relativistic quantum
mechanics using the Wigner phase-space distribution function and apply this
concept to two physical situations at the interface of quantum theory and
general relativity: (i) the motion of an ensemble of cold atoms relevant to
tests of the weak equivalence principle, and (ii) the Kasevich-Chu
interferometer. In order to lay the foundations for this analysis we first
present a representation-free description of the Kasevich-Chu interferometer
based on unitary operators.Comment: 69 pages, 6 figures, minor changes to match the published version.
The original publication is available at
http://en.sif.it/books/series/proceedings_fermi or
http://ebooks.iospress.nl/volumearticle/3809
On Zero-One and Convergence Laws for Graphs Embeddable on a Fixed Surface
We show that for no surface except for the plane does monadic second-order logic (MSO) have a zero-one-law - and not even a convergence law - on the class of (connected) graphs embeddable on the surface. In addition we show that every rational in [0,1] is the limiting probability of some MSO formula. This strongly refutes a conjecture by Heinig et al. (2014) who proved a convergence law for planar graphs, and a zero-one law for connected planar graphs, and also identified the so-called gaps of [0,1]: the subintervals that are not limiting probabilities of any MSO formula. The proof relies on a combination of methods from structural graph theory, especially large face-width embeddings of graphs on surfaces, analytic combinatorics, and finite model theory, and several parts of the proof may be of independent interest. In particular, we identify precisely the properties that make the zero-one law work on planar graphs but fail for every other surface
Automated Design of a Correction Dipole Magnet for LHC
A correction dipole magnet, with a horizontal dipole nested inside a vertical dipole has been designed and optimized linking together different electromagnetic software and CAD/CAM systems. The necessary interfaces have recently been established in the program ROXIE which has been developed at CERN for the automatic generation and optimization of superconducting coil geometries. The program provides, in addition to a mathematical optimization chest, interfaces to commercial electromagnetic and structural software packages, CAD/CAM and databases. The results from electromagnetic calculations with different programs have been compared. Some modelling considerations to reduce the computation time are also given
The Single Row Routing Problem Revisited: A Solution Based on Genetic Algorithms
With the advent of VLSI technology, circuits with more than one million transistors have been integrated onto a single chip. As the complexity of ICs grows, the time and money spent on designing the circuits become more important. A large, often dominant, part of the cost and time required to design an IC is consumed in the routing operation. The routing of carriers, such as in IC chips and printed circuit boards, is a classical problem in Computer Aided Design. With the complexity inherent in VLSI circuits, high performance routers are necessary. In this paper, a crucial step in the channel routing technique, the single row routing (SRR) problem, is considered. First, we discuss the relevance of SRR in the context of the general routing problem. Secondly, we show that heuristic algorithms are far from solving the general problem. Next, we introduce evolutionary computation, and, in particular, genetic algorithms (GAs) as a justifiable method in solving the SRR problem. Finally, an efficient O(nk) complexity technique based on GAs heuristic is obtained to solve the general SRR problem containing n nodes. Experimental results show that the algorithm is faster and can often generate better results than many of the leading heuristics proposed in the literature
Genome-wide DNA polymorphism analyses using VariScan
BACKGROUND: DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. RESULTS: We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. CONCLUSION: VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data
- …