853 research outputs found
Ensemble of Example-Dependent Cost-Sensitive Decision Trees
Several real-world classification problems are example-dependent
cost-sensitive in nature, where the costs due to misclassification vary between
examples and not only within classes. However, standard classification methods
do not take these costs into account, and assume a constant cost of
misclassification errors. In previous works, some methods that take into
account the financial costs into the training of different algorithms have been
proposed, with the example-dependent cost-sensitive decision tree algorithm
being the one that gives the highest savings. In this paper we propose a new
framework of ensembles of example-dependent cost-sensitive decision-trees. The
framework consists in creating different example-dependent cost-sensitive
decision trees on random subsamples of the training set, and then combining
them using three different combination approaches. Moreover, we propose two new
cost-sensitive combination approaches; cost-sensitive weighted voting and
cost-sensitive stacking, the latter being based on the cost-sensitive logistic
regression method. Finally, using five different databases, from four
real-world applications: credit card fraud detection, churn modeling, credit
scoring and direct marketing, we evaluate the proposed method against
state-of-the-art example-dependent cost-sensitive techniques, namely,
cost-proportionate sampling, Bayes minimum risk and cost-sensitive decision
trees. The results show that the proposed algorithms have better results for
all databases, in the sense of higher savings.Comment: 13 pages, 6 figures, Submitted for possible publicatio
The Sub-Surface Structure of a Large Sample of Active Regions
We employ ring-diagram analysis to study the sub-surface thermal structure of
active regions. We present results using a large number of active regions over
the course of Solar Cycle 23. We present both traditional inversions of
ring-diagram frequency differences, with a total sample size of 264, and a
statistical study using Principal Component Analysis. We confirm earlier
results on smaller samples that sound speed and adiabatic index are changed
below regions of strong magnetic field. We find that sound speed is decreased
in the region between approximately r=0.99R_sun and r=0.995R_sun (depths of 3Mm
to 7Mm), and increased in the region between r=0.97R_sun and r=0.985R_sun
(depths of 11Mm to 21Mm). The adiabatic index is enhanced in the same deeper
layers that sound-speed enhancement is seen. A weak decrease in adiabatic index
is seen in the shallower layers in many active regions. We find that the
magnitudes of these perturbations depend on the strength of the surface
magnetic field, but we find a great deal of scatter in this relation, implying
other factors may be relevant.Comment: 16 pages, 11 figures, accepted for publication in Solar Physic
A survey of cost-sensitive decision tree induction algorithms
The past decade has seen a significant interest on the problem of inducing decision trees that take account of costs of misclassification and costs of acquiring the features used for decision making. This survey identifies over 50 algorithms including approaches that are direct adaptations of accuracy based methods, use genetic algorithms, use anytime methods and utilize boosting and bagging. The survey brings together these different studies and novel approaches to cost-sensitive decision tree learning, provides a useful taxonomy, a historical timeline of how the field has developed and should provide a useful reference point for future research in this field
Mitochondrial reactive oxygen species are scavenged by Cockayne syndrome B protein in human fibroblasts without nuclear DNA damage
Cockayne syndrome (CS) is a human DNA repair-deficient disease that involves transcription coupled repair (TCR), in which three gene products, Cockayne syndrome A (CSA), Cockayne syndrome B (CSB), and ultraviolet stimulated scaffold protein A (UVSSA) cooperate in relieving RNA polymerase II arrest at damaged sites to permit repair of the template strand. Mutation of any of these three genes results in cells with increased sensitivity to UV light and defective TCR. Mutations in CSA or CSB are associated with severe neurological disease but mutations in UVSSA are for the most part only associated with increased photosensitivity. This difference raises questions about the relevance of TCR to neurological disease in CS. We find that CSB-mutated cells, but not UVSSA-deficient cells, have increased levels of intramitochondrial reactive oxygen species (ROS), especially when mitochondrial complex I is inhibited by rotenone. Increased ROS would result in oxidative damage to mitochondrial proteins, lipids, and DNA. CSB appears to behave as an electron scavenger in the mitochondria whose absence leads to increased oxidative stress. Mitochondrial ROS, however, did not cause detectable nuclear DNA damage even when base excision repair was blocked by an inhibitor of polyADP ribose polymerase. Neurodegeneration in Cockayne syndrome may therefore be associated with ROS-induced damage in the mitochondria, independent of nuclear TCR. An implication of our present results is that mitochondrial dysfunction involving ROS has a major impact on CS-B pathology, whereas nuclear TCR may have a minimal role
Climatic applicability of downdraught evaporative cooling in the United States of America
The potential for application of downdraught cooling in the United States of America (U.S.) depends on its climatic characteristics. However, due to the large geographic span of the country, it varies due to differences in latitude, and a range of geographic features influencing climate, including altitude, topography and terrain. This study describes the development of climatic applicability maps of downdraught cooling in the U.S., which can aid designers in the initial identification of the correct cooling strategy for the geographic area of interest. The proposed approach is based on a set of maps, which are derived from two related climatic indexes: dry bulb temperature to wet bulb temperature depression (DBTâWBT), representing the climatic opportunity, and 26ÂșC minus wet bulb temperature (26ÂșCâWBT), representing the climatic opportunity against the theoretical cooling requirement for each location. The downdraught cooling strategy and degree of applicability is classified in the map, based on the aforementioned climatic and cooling parameters. Finally, four representative buildings in four different regions with different climatic conditions were selected for climatic analysis. This resulted in the identification of some climate zones for downdraught cooling application in the U.S. and the suggestion of appropriate design strategies for each of them
Cosmology and Astrophysics from Relaxed Galaxy Clusters I: Sample Selection
This is the first in a series of papers studying the astrophysics and
cosmology of massive, dynamically relaxed galaxy clusters. Here we present a
new, automated method for identifying relaxed clusters based on their
morphologies in X-ray imaging data. While broadly similar to others in the
literature, the morphological quantities that we measure are specifically
designed to provide a fair basis for comparison across a range of data quality
and cluster redshifts, to be robust against missing data due to point-source
masks and gaps between detectors, and to avoid strong assumptions about the
cosmological background and cluster masses. Based on three morphological
indicators - Symmetry, Peakiness and Alignment - we develop the SPA criterion
for relaxation. This analysis was applied to a large sample of cluster
observations from the Chandra and ROSAT archives. Of the 361 clusters which
received the SPA treatment, 57 (16 per cent) were subsequently found to be
relaxed according to our criterion. We compare our measurements to similar
estimators in the literature, as well as projected ellipticity and other image
measures, and comment on trends in the relaxed cluster fraction with redshift,
temperature, and survey selection method. Code implementing our morphological
analysis will be made available on the web.Comment: MNRAS, in press. 43 pages in total, of which 17 are tables (please
think twice before printing). 18 figures, 4 tables. Machine-readable tables
will be available from the journal and at the url below; code will be posted
at http://www.slac.stanford.edu/~amantz/work/morph14
From Simple to Complex and Ultra-complex Systems:\ud A Paradigm Shift Towards Non-Abelian Systems Dynamics
Atoms, molecules, organisms distinguish layers of reality because of the causal links that govern their behavior, both horizontally (atom-atom, molecule-molecule, organism-organism) and vertically (atom-molecule-organism). This is the first intuition of the theory of levels. Even if the further development of the theory will require imposing a number of qualifications to this initial intuition, the idea of a series of entities organized on different levels of complexity will prove correct. Living systems as well as social systems and the human mind present features remarkably different from those characterizing non-living, simple physical and chemical systems. We propose that super-complexity requires at least four different categorical frameworks, provided by the theories of levels of reality, chronotopoids, (generalized) interactions, and anticipation
From Simple to Complex and Ultra-complex Systems:\ud A Paradigm Shift Towards Non-Abelian Systems Dynamics
Atoms, molecules, organisms distinguish layers of reality because of the causal links that govern their behavior, both horizontally (atom-atom, molecule-molecule, organism-organism) and vertically (atom-molecule-organism). This is the first intuition of the theory of levels. Even if the further development of the theory will require imposing a number of qualifications to this initial intuition, the idea of a series of entities organized on different levels of complexity will prove correct. Living systems as well as social systems and the human mind present features remarkably different from those characterizing non-living, simple physical and chemical systems. We propose that super-complexity requires at least four different categorical frameworks, provided by the theories of levels of reality, chronotopoids, (generalized) interactions, and anticipation
Correlations in the (Sub)millimeter Background from ACT Ă BLAST
We present measurements of the auto- and cross-frequency correlation power spectra of the cosmic (sub)millimeter background at 250, 350, and 500 ÎŒm (1200, 860, and 600 GHz) from observations made with the Balloon-borne Large Aperture Submillimeter Telescope (BLAST); and at 1380 and 2030 ÎŒm (218 and 148 GHz) from observations made with the Atacama Cosmology Telescope (ACT). The overlapping observations cover 8.6 deg^2 in an area relatively free of Galactic dust near the south ecliptic pole. The ACT bands are sensitive to radiation from the cosmic microwave background, to the Sunyaev-Zel'dovich effect from galaxy clusters, and to emission by radio and dusty star-forming galaxies (DSFGs), while the dominant contribution to the BLAST bands is from DSFGs. We confirm and extend the BLAST analysis of clustering with an independent pipeline and also detect correlations between the ACT and BLAST maps at over 25Ï significance, which we interpret as a detection of the DSFGs in the ACT maps. In addition to a Poisson component in the cross-frequency power spectra, we detect a clustered signal at 4Ï, and using a model for the DSFG evolution and number counts, we successfully fit all of our spectra with a linear clustering model and a bias that depends only on redshift and not on scale. Finally, the data are compared to, and generally agree with, phenomenological models for the DSFG population. This study demonstrates the constraining power of the cross-frequency correlation technique to constrain models for the DSFGs. Similar analyses with more data will impose tight constraints on future models
- âŠ