4,115 research outputs found
QCBA: Postoptimization of Quantitative Attributes in Classifiers based on Association Rules
The need to prediscretize numeric attributes before they can be used in
association rule learning is a source of inefficiencies in the resulting
classifier. This paper describes several new rule tuning steps aiming to
recover information lost in the discretization of numeric (quantitative)
attributes, and a new rule pruning strategy, which further reduces the size of
the classification models. We demonstrate the effectiveness of the proposed
methods on postoptimization of models generated by three state-of-the-art
association rule classification algorithms: Classification based on
Associations (Liu, 1998), Interpretable Decision Sets (Lakkaraju et al, 2016),
and Scalable Bayesian Rule Lists (Yang, 2017). Benchmarks on 22 datasets from
the UCI repository show that the postoptimized models are consistently smaller
-- typically by about 50% -- and have better classification performance on most
datasets
Probing the fuzzy sphere regularisation in simulations of the 3d \lambda \phi^4 model
We regularise the 3d \lambda \phi^4 model by discretising the Euclidean time
and representing the spatial part on a fuzzy sphere. The latter involves a
truncated expansion of the field in spherical harmonics. This yields a
numerically tractable formulation, which constitutes an unconventional
alternative to the lattice. In contrast to the 2d version, the radius R plays
an independent r\^{o}le. We explore the phase diagram in terms of R and the
cutoff, as well as the parameters m^2 and \lambda. Thus we identify the phases
of disorder, uniform order and non-uniform order. We compare the result to the
phase diagrams of the 3d model on a non-commutative torus, and of the 2d model
on a fuzzy sphere. Our data at strong coupling reproduce accurately the
behaviour of a matrix chain, which corresponds to the c=1-model in string
theory. This observation enables a conjecture about the thermodynamic limit.Comment: 31 pages, 15 figure
On the role of pre and post-processing in environmental data mining
The quality of discovered knowledge is highly depending on data quality. Unfortunately real data use to contain noise, uncertainty, errors, redundancies or even irrelevant information. The more complex is the reality to be analyzed, the higher the risk of getting low quality data. Knowledge Discovery from Databases (KDD) offers a global framework to prepare data in the right form to perform correct analyses. On the other hand, the quality of decisions taken upon KDD results, depend not only on the quality of the results themselves, but on the capacity of the system to communicate those results in an understandable form. Environmental systems are particularly complex and environmental users particularly require clarity in their results. In this paper some details about how this can be achieved are provided. The role of the pre and post processing in the whole process of Knowledge Discovery in environmental systems is discussed
Medical imaging analysis with artificial neural networks
Given that neural networks have been widely reported in the research community of medical imaging, we provide a focused literature survey on recent neural network developments in computer-aided diagnosis, medical image segmentation and edge detection towards visual content analysis, and medical image registration for its pre-processing and post-processing, with the aims of increasing awareness of how neural networks can be applied to these areas and to provide a foundation for further research and practical development. Representative techniques and algorithms are explained in detail to provide inspiring examples illustrating: (i) how a known neural network with fixed structure and training procedure could be applied to resolve a medical imaging problem; (ii) how medical images could be analysed, processed, and characterised by neural networks; and (iii) how neural networks could be expanded further to resolve problems relevant to medical imaging. In the concluding section, a highlight of comparisons among many neural network applications is included to provide a global view on computational intelligence with neural networks in medical imaging
On the Inversion of High Energy Proton
Inversion of the K-fold stochastic autoconvolution integral equation is an
elementary nonlinear problem, yet there are no de facto methods to solve it
with finite statistics. To fix this problem, we introduce a novel inverse
algorithm based on a combination of minimization of relative entropy, the Fast
Fourier Transform and a recursive version of Efron's bootstrap. This gives us
power to obtain new perspectives on non-perturbative high energy QCD, such as
probing the ab initio principles underlying the approximately negative binomial
distributions of observed charged particle final state multiplicities, related
to multiparton interactions, the fluctuating structure and profile of proton
and diffraction. As a proof-of-concept, we apply the algorithm to ALICE
proton-proton charged particle multiplicity measurements done at different
center-of-mass energies and fiducial pseudorapidity intervals at the LHC,
available on HEPData. A strong double peak structure emerges from the
inversion, barely visible without it.Comment: 29 pages, 10 figures, v2: extended analysis (re-projection ratios,
2D
Eigenvalue-flipping Algorithm for Matrix Monte Carlo
Many physical systems can be described in terms of matrix models that we
often cannot solve analytically. Fortunately, they can be studied numerically
in a straightforward way. Many commonly used algorithms follow the Monte Carlo
method, which is efficient for small matrix sizes but cannot guarantee
ergodicity when working with large ones. In this paper, we propose an
improvement of the algorithm that, for a large class of matrix models, allows
to tunnel between various vacua in a proficient way, where sign change of
eigenvalues is proposed externally. We test the method on two models: the pure
potential matrix model and the scalar field theory on the fuzzy sphere.Comment: v2: minor corrections, references adde
Numerical studies of the critical behaviour of non-commutative field theories
We study the critical behaviour of matrix models with builtin
SU(2) geometry by using Hybrid Monte Carlo (HMC) techniques.
The first system under study is a matrix regularization of the
φ4 theory defined on the sphere. We develop a HMC algorithm
together with an SU(2) gauge-fixing procedure in order to study
the model. We extract the phase diagram of the model and give
an estimation for the triple point for a system constructed of
matrices of size N = 7. Our numerical results also suggest the
existence of stripe phases- phases in which modes with higher
momentum l have non-negligible contribution.
The second system under study is a matrix model realized via
competing Yang-Mills and Myers terms. In its low-temperature
phase the system has geometrical phase with SO(3) symmetry:
the ground state is represented by the su(2) generators. This
geometry disappears in the high-temperature phase the system.
Our results suggest that there are three main types of fluctuations
in the system close to the transition: fluctuations of the
fuzzy sphere, fluctuations which drive the system between the
two phases, and fluctuations of the high-temperature regime.
The fluctuations of the fuzzy sphere show the properties of a
second order phase transition. We establish the validity of the
finite size scaling ansatz in that regime. The fluctuations which
bring the system between the phases show the properties of a
first order transition.
In the Appendix we provide in some detail the idea behind
the HMC approach. We give some practical guidelines if one is to
implement such an algorithm to study matrix models. We comment
on the main sources for the phenomenon of autocorrelation
time. As a final topic we present the basics of the OpenCL language
which we used to port some of our algorithms for parallel
computing architectures such as GPU’s
- …