4,115 research outputs found

    QCBA: Postoptimization of Quantitative Attributes in Classifiers based on Association Rules

    Full text link
    The need to prediscretize numeric attributes before they can be used in association rule learning is a source of inefficiencies in the resulting classifier. This paper describes several new rule tuning steps aiming to recover information lost in the discretization of numeric (quantitative) attributes, and a new rule pruning strategy, which further reduces the size of the classification models. We demonstrate the effectiveness of the proposed methods on postoptimization of models generated by three state-of-the-art association rule classification algorithms: Classification based on Associations (Liu, 1998), Interpretable Decision Sets (Lakkaraju et al, 2016), and Scalable Bayesian Rule Lists (Yang, 2017). Benchmarks on 22 datasets from the UCI repository show that the postoptimized models are consistently smaller -- typically by about 50% -- and have better classification performance on most datasets

    Probing the fuzzy sphere regularisation in simulations of the 3d \lambda \phi^4 model

    Get PDF
    We regularise the 3d \lambda \phi^4 model by discretising the Euclidean time and representing the spatial part on a fuzzy sphere. The latter involves a truncated expansion of the field in spherical harmonics. This yields a numerically tractable formulation, which constitutes an unconventional alternative to the lattice. In contrast to the 2d version, the radius R plays an independent r\^{o}le. We explore the phase diagram in terms of R and the cutoff, as well as the parameters m^2 and \lambda. Thus we identify the phases of disorder, uniform order and non-uniform order. We compare the result to the phase diagrams of the 3d model on a non-commutative torus, and of the 2d model on a fuzzy sphere. Our data at strong coupling reproduce accurately the behaviour of a matrix chain, which corresponds to the c=1-model in string theory. This observation enables a conjecture about the thermodynamic limit.Comment: 31 pages, 15 figure

    On the role of pre and post-processing in environmental data mining

    Get PDF
    The quality of discovered knowledge is highly depending on data quality. Unfortunately real data use to contain noise, uncertainty, errors, redundancies or even irrelevant information. The more complex is the reality to be analyzed, the higher the risk of getting low quality data. Knowledge Discovery from Databases (KDD) offers a global framework to prepare data in the right form to perform correct analyses. On the other hand, the quality of decisions taken upon KDD results, depend not only on the quality of the results themselves, but on the capacity of the system to communicate those results in an understandable form. Environmental systems are particularly complex and environmental users particularly require clarity in their results. In this paper some details about how this can be achieved are provided. The role of the pre and post processing in the whole process of Knowledge Discovery in environmental systems is discussed

    Medical imaging analysis with artificial neural networks

    Get PDF
    Given that neural networks have been widely reported in the research community of medical imaging, we provide a focused literature survey on recent neural network developments in computer-aided diagnosis, medical image segmentation and edge detection towards visual content analysis, and medical image registration for its pre-processing and post-processing, with the aims of increasing awareness of how neural networks can be applied to these areas and to provide a foundation for further research and practical development. Representative techniques and algorithms are explained in detail to provide inspiring examples illustrating: (i) how a known neural network with fixed structure and training procedure could be applied to resolve a medical imaging problem; (ii) how medical images could be analysed, processed, and characterised by neural networks; and (iii) how neural networks could be expanded further to resolve problems relevant to medical imaging. In the concluding section, a highlight of comparisons among many neural network applications is included to provide a global view on computational intelligence with neural networks in medical imaging

    On the Inversion of High Energy Proton

    Full text link
    Inversion of the K-fold stochastic autoconvolution integral equation is an elementary nonlinear problem, yet there are no de facto methods to solve it with finite statistics. To fix this problem, we introduce a novel inverse algorithm based on a combination of minimization of relative entropy, the Fast Fourier Transform and a recursive version of Efron's bootstrap. This gives us power to obtain new perspectives on non-perturbative high energy QCD, such as probing the ab initio principles underlying the approximately negative binomial distributions of observed charged particle final state multiplicities, related to multiparton interactions, the fluctuating structure and profile of proton and diffraction. As a proof-of-concept, we apply the algorithm to ALICE proton-proton charged particle multiplicity measurements done at different center-of-mass energies and fiducial pseudorapidity intervals at the LHC, available on HEPData. A strong double peak structure emerges from the inversion, barely visible without it.Comment: 29 pages, 10 figures, v2: extended analysis (re-projection ratios, 2D

    Eigenvalue-flipping Algorithm for Matrix Monte Carlo

    Full text link
    Many physical systems can be described in terms of matrix models that we often cannot solve analytically. Fortunately, they can be studied numerically in a straightforward way. Many commonly used algorithms follow the Monte Carlo method, which is efficient for small matrix sizes but cannot guarantee ergodicity when working with large ones. In this paper, we propose an improvement of the algorithm that, for a large class of matrix models, allows to tunnel between various vacua in a proficient way, where sign change of eigenvalues is proposed externally. We test the method on two models: the pure potential matrix model and the scalar field theory on the fuzzy sphere.Comment: v2: minor corrections, references adde

    Numerical studies of the critical behaviour of non-commutative field theories

    Get PDF
    We study the critical behaviour of matrix models with builtin SU(2) geometry by using Hybrid Monte Carlo (HMC) techniques. The first system under study is a matrix regularization of the φ4 theory defined on the sphere. We develop a HMC algorithm together with an SU(2) gauge-fixing procedure in order to study the model. We extract the phase diagram of the model and give an estimation for the triple point for a system constructed of matrices of size N = 7. Our numerical results also suggest the existence of stripe phases- phases in which modes with higher momentum l have non-negligible contribution. The second system under study is a matrix model realized via competing Yang-Mills and Myers terms. In its low-temperature phase the system has geometrical phase with SO(3) symmetry: the ground state is represented by the su(2) generators. This geometry disappears in the high-temperature phase the system. Our results suggest that there are three main types of fluctuations in the system close to the transition: fluctuations of the fuzzy sphere, fluctuations which drive the system between the two phases, and fluctuations of the high-temperature regime. The fluctuations of the fuzzy sphere show the properties of a second order phase transition. We establish the validity of the finite size scaling ansatz in that regime. The fluctuations which bring the system between the phases show the properties of a first order transition. In the Appendix we provide in some detail the idea behind the HMC approach. We give some practical guidelines if one is to implement such an algorithm to study matrix models. We comment on the main sources for the phenomenon of autocorrelation time. As a final topic we present the basics of the OpenCL language which we used to port some of our algorithms for parallel computing architectures such as GPU’s
    corecore