213,091 research outputs found
The structure of a logarithmic advice class
The complexity class Full-P / log, corresponding to a form of logarithmic
advice for polynomial time, is studied.
In order to understand the inner structure of this class, we characterize
Full-P /log in terms of Turing reducibility to a special family of
sparse sets. Other characterizations of Full-P / log, relating it to sets
with small information content, were already known. These used tally
sets whose words follow a given regular pattern and tally sets that are
regular in a resource-bounded Kolmogorov complexity sense.
We obtain here relationships between the equivalence classes of
the mentioned tally and sparse sets under various reducibiities, which
provide new knowledge about the logarithmic advice class.
Another way to measure the amount of information encoded in a
language in a nonuniform class, is to study the relative complexity of
computing advice functions for this language. We prove bounds on the
complexity of ad vice functions for Full-P / log and for other subclasses
of it. As a consequence, Full-P / log is located in the Extended Low
Hierarchy
Rectilinear Steiner Trees in Narrow Strips
A rectilinear Steiner tree for a set of points in is a
tree that connects the points in using horizontal and vertical line
segments. The goal of Minimal Rectilinear Steiner Tree is to find a rectilinear
Steiner tree with minimal total length. We investigate how the complexity of
Minimal Rectilinear Steiner Tree for point sets inside the strip
depends on the strip width . We
obtain two main results. 1) We present an algorithm with running time
for sparse point sets, that is, point sets where each
rectangle inside the strip contains points. 2) For
random point sets, where the points are chosen randomly inside a rectangle of
height and expected width , we present an algorithm that is
fixed-parameter tractable with respect to and linear in . It has an
expected running time of .Comment: 21 pages, 13 figure
Baire categories on small complexity classes and meager–comeager laws
We introduce two resource-bounded Baire category notions on small complexity classes such as P, QUASIPOLY, SUBEXP and PSPACE and on probabilistic classes such as BPP, which differ on how the corresponding finite extension strategies are computed. We give an alternative characterization of small sets via resource-bounded Banach-Mazur games. As an application of the first notion, we show that for almost every language A (i.e. all except a meager class) computable in subexponential time, PA = BPPA. We also show that almost all languages in PSPACE do not have small nonuniform complexity. We then switch to the second Baire category notion (called locally-computable), and show that the class SPARSE is meager in P. We show that in contrast to the resource-bounded measure case, meager–comeager laws can be obtained for many standard complexity classes, relative to locally-computable Baire category on BPP and PSPACE. Another topic where locally-computable Baire categories differ from resource-bounded measure is regarding weak-completeness: we show that there is no weak-completeness notion in P based on locally-computable Baire categories, i.e. every P-weakly-complete set is complete for P. We also prove that the class of complete sets for P under Turing-logspace reductions is meager in P, if P is not equal to DSPACE (log n), and that the same holds unconditionally for QUASIPOLY. Finally we observe that locally-computable Baire categories are incomparable with all existing resource-bounded measure notions on small complexity classes, which might explain why those two settings seem to differ so fundamentally
strip
We investigate how the complexity of Euclidean TSP for point sets inside the strip depends on the strip width . We obtain two main results. First, for the case where the points have distinct integer -coordinates, we prove that a shortest bitonic tour (which can be computed in time using an existing algorithm) is guaranteed to be a shortest tour overall when , a bound which is best possible. Second, we present an algorithm that is fixed-parameter tractable with respect to . More precisely, our algorithm has running time for sparse point sets, where each rectangle inside the strip contains points. For random point sets, where the points are chosen uniformly at random from the rectangle~, it has an expected running time of
Elastic SCAD as a novel penalization method for SVM classification tasks in high-dimensional data
<p>Abstract</p> <p>Background</p> <p>Classification and variable selection play an important role in knowledge discovery in high-dimensional data. Although Support Vector Machine (SVM) algorithms are among the most powerful classification and prediction methods with a wide range of scientific applications, the SVM does not include automatic feature selection and therefore a number of feature selection procedures have been developed. Regularisation approaches extend SVM to a feature selection method in a flexible way using penalty functions like LASSO, SCAD and Elastic Net.</p> <p>We propose a novel penalty function for SVM classification tasks, Elastic SCAD, a combination of SCAD and ridge penalties which overcomes the limitations of each penalty alone.</p> <p>Since SVM models are extremely sensitive to the choice of tuning parameters, we adopted an interval search algorithm, which in comparison to a fixed grid search finds rapidly and more precisely a global optimal solution.</p> <p>Results</p> <p>Feature selection methods with combined penalties (Elastic Net and Elastic SCAD SVMs) are more robust to a change of the model complexity than methods using single penalties. Our simulation study showed that Elastic SCAD SVM outperformed LASSO (<it>L</it><sub>1</sub>) and SCAD SVMs. Moreover, Elastic SCAD SVM provided sparser classifiers in terms of median number of features selected than Elastic Net SVM and often better predicted than Elastic Net in terms of misclassification error.</p> <p>Finally, we applied the penalization methods described above on four publicly available breast cancer data sets. Elastic SCAD SVM was the only method providing robust classifiers in sparse and non-sparse situations.</p> <p>Conclusions</p> <p>The proposed Elastic SCAD SVM algorithm provides the advantages of the SCAD penalty and at the same time avoids sparsity limitations for non-sparse data. We were first to demonstrate that the integration of the interval search algorithm and penalized SVM classification techniques provides fast solutions on the optimization of tuning parameters.</p> <p>The penalized SVM classification algorithms as well as fixed grid and interval search for finding appropriate tuning parameters were implemented in our freely available R package 'penalizedSVM'.</p> <p>We conclude that the Elastic SCAD SVM is a flexible and robust tool for classification and feature selection tasks for high-dimensional data such as microarray data sets.</p
Fixed-point tile sets and their applications
v4: added references to a paper by Nicolas Ollinger and several historical commentsAn aperiodic tile set was first constructed by R. Berger while proving the undecidability of the domino problem. It turned out that aperiodic tile sets appear in many topics ranging from logic (the Entscheidungsproblem) to physics (quasicrystals). We present a new construction of an aperiodic tile set that is based on Kleene's fixed-point construction instead of geometric arguments. This construction is similar to J. von Neumann self-reproducing automata; similar ideas were also used by P. Gacs in the context of error-correcting computations. This construction it rather flexible, so it can be used in many ways: we show how it can be used to implement substitution rules, to construct strongly aperiodic tile sets (any tiling is far from any periodic tiling), to give a new proof for the undecidability of the domino problem and related results, characterize effectively closed 1D subshift it terms of 2D shifts of finite type (improvement of a result by M. Hochman), to construct a tile set which has only complex tilings, and to construct a "robust" aperiodic tile set that does not have periodic (or close to periodic) tilings even if we allow some (sparse enough) tiling errors. For the latter we develop a hierarchical classification of points in random sets into islands of different ranks. Finally, we combine and modify our tools to prove our main result: there exists a tile set such that all tilings have high Kolmogorov complexity even if (sparse enough) tiling errors are allowed
- …