6,118 research outputs found

    Music Similarity Estimation

    Get PDF
    Music is a complicated form of communication, where creators and culture communicate and expose their individuality. After music digitalization took place, recommendation systems and other online services have become indispensable in the field of Music Information Retrieval (MIR). To build these systems and recommend the right choice of song to the user, classification of songs is required. In this paper, we propose an approach for finding similarity between music based on mid-level attributes like pitch, midi value corresponding to pitch, interval, contour and duration and applying text based classification techniques. Our system predicts jazz, metal and ragtime for western music. The experiment to predict the genre of music is conducted based on 450 music files and maximum accuracy achieved is 95.8% across different n-grams. We have also analyzed the Indian classical Carnatic music and are classifying them based on its raga. Our system predicts Sankarabharam, Mohanam and Sindhubhairavi ragas. The experiment to predict the raga of the song is conducted based on 95 music files and the maximum accuracy achieved is 90.3% across different n-grams. Performance evaluation is done by using the accuracy score of scikit-learn

    Contributions to statistical machine learning algorithm

    Get PDF
    This thesis's research focus is on computational statistics along with DEAR (abbreviation of differential equation associated regression) model direction, and that in mind, the journal papers are written as contributions to statistical machine learning algorithm literature

    Calibrating the Na\"ive Cornell Model with NRQCD

    Full text link
    Along the years, the Cornell Model has been extraordinarily successful in describing hadronic phenomenology, in particular in physical situations for which an effective theory of the strong interactions such as NRQCD cannot be applied. As a consequence of its achievements, a relevant question is whether its model parameters can somehow be related to fundamental constants of QCD. We shall give a first answer in this article by comparing the predictions of both approaches. Building on results from a previous study on heavy meson spectroscopy, we calibrate the Cornell model employing NRQCD predictions for the lowest-lying bottomonium states up to N3^3LO, in which the bottom mass is varied within a wide range. We find that the Cornell model mass parameter can be identified, within perturbative uncertainties, with the MSR mass at the scale R=1 R = 1\,GeV. This identification holds for any value of αs\alpha_s or the bottom mass, and for all perturbative orders investigated. Furthermore, we show that: a) the "string tension" parameter is independent of the bottom mass, and b) the Coulomb strength κ\kappa of the Cornell model can be related to the QCD strong coupling constant αs\alpha_s at a characteristic non-relativistic scale. We also show how to remove the u=1/2u=1/2 renormalon of the static QCD potential and sum-up large logs related to the renormalon subtraction by switching to the low-scale, short-distance MSR mass, and using R-evolution. Our R-improved expression for the static potential remains independent of the heavy quark mass value and agrees with lattice QCD results for values of the radius as large as 0.8 0.8\,fm, and with the Cornell model potential at long distances. Finally we show that for moderate values of rr, the R-improved NRQCD and Cornell static potentials are in head-on agreement.Comment: 22 pages, 13 figures, 3 table

    Exceptional thermodynamics: The equation of state of G(2) gauge theory

    Get PDF
    We present a lattice study of the equation of state in Yang-Mills theory based on the exceptional G(2) gauge group. As is well-known, at zero temperature this theory shares many qualitative features with real-world QCD, including the absence of colored states in the spectrum and dynamical string breaking at large distances. In agreement with previous works, we show that at finite temperature this theory features a first-order deconfining phase transition, whose nature can be studied by a semi-classical computation. We also show that the equilibrium thermodynamic observables in the deconfined phase bear striking quantitative similarities with those found in SU(N) gauge theories: in particular, these quantities exhibit nearly perfect proportionality to the number of gluon degrees of freedom, and the trace anomaly reveals a characteristic quadratic dependence on the temperature, also observed in SU(N) Yang-Mills theories (both in four and in three spacetime dimensions). We compare our lattice data with analytical predictions from effective models, and discuss their implications for the deconfinement mechanism and high-temperature properties of strongly interacting, non-supersymmetric gauge theories. Our results give strong evidence for the conjecture that the thermal deconfining transition is governed by a universal mechanism, common to all simple gauge groups.Comment: 1+36 pages, 8 figures; v2, 1+41 pages, 9 figures: scale setting improved, discussion in section 1 slightly expanded, comments on the Monte Carlo algorithm added, new references included, affiliation details for one of the authors updated, minor misprints corrected: version published in the journa

    HoloDetect: Few-Shot Learning for Error Detection

    Full text link
    We introduce a few-shot learning framework for error detection. We show that data augmentation (a form of weak supervision) is key to training high-quality, ML-based error detection models that require minimal human involvement. Our framework consists of two parts: (1) an expressive model to learn rich representations that capture the inherent syntactic and semantic heterogeneity of errors; and (2) a data augmentation model that, given a small seed of clean records, uses dataset-specific transformations to automatically generate additional training data. Our key insight is to learn data augmentation policies from the noisy input dataset in a weakly supervised manner. We show that our framework detects errors with an average precision of ~94% and an average recall of ~93% across a diverse array of datasets that exhibit different types and amounts of errors. We compare our approach to a comprehensive collection of error detection methods, ranging from traditional rule-based methods to ensemble-based and active learning approaches. We show that data augmentation yields an average improvement of 20 F1 points while it requires access to 3x fewer labeled examples compared to other ML approaches.Comment: 18 pages

    NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Figure 1

    Get PDF
    Figure 1 demonstrates a series of training experiments with the Naïve Bayes classifier using different neighborhoods for contextual features, different sizes of positive and negative training examples and evaluated the resulting classifiers with our annotated gold standard corpus. The data sets are the results of running NetiNeti on subset of 136 PubMedCentral tagged open access articles and with no stop list.A scientific name for an organism can be associated with almost all biological data. Name identification is an important step in many text mining tasks aiming to extract useful information from biological, biomedical and biodiversity text sources. A scientific name acts as an important metadata element to link biological information.We present NetiNeti, a machine learning based approach for identification and discovery of scientific names. The system implementing the approach can be accessed at http://namefinding.ubio.org we present the comparison results of various machine learning algorithms on our annotated corpus. Naïve Bayes and Maximum Entropy with Generalized Iterative Scaling (GIS) parameter estimation are the top two performing algorithms
    • …
    corecore