6,118 research outputs found
Music Similarity Estimation
Music is a complicated form of communication, where creators and culture communicate and expose their individuality. After music digitalization took place, recommendation systems and other online services have become indispensable in the field of Music Information Retrieval (MIR). To build these systems and recommend the right choice of song to the user, classification of songs is required. In this paper, we propose an approach for finding similarity between music based on mid-level attributes like pitch, midi value corresponding to pitch, interval, contour and duration and applying text based classification techniques. Our system predicts jazz, metal and ragtime for western music. The experiment to predict the genre of music is conducted based on 450 music files and maximum accuracy achieved is 95.8% across different n-grams. We have also analyzed the Indian classical Carnatic music and are classifying them based on its raga. Our system predicts Sankarabharam, Mohanam and Sindhubhairavi ragas. The experiment to predict the raga of the song is conducted based on 95 music files and the maximum accuracy achieved is 90.3% across different n-grams. Performance evaluation is done by using the accuracy score of scikit-learn
Contributions to statistical machine learning algorithm
This thesis's research focus is on computational statistics along with DEAR (abbreviation of differential equation associated regression) model direction, and that in mind, the journal papers are written as contributions to statistical machine learning algorithm literature
Calibrating the Na\"ive Cornell Model with NRQCD
Along the years, the Cornell Model has been extraordinarily successful in
describing hadronic phenomenology, in particular in physical situations for
which an effective theory of the strong interactions such as NRQCD cannot be
applied. As a consequence of its achievements, a relevant question is whether
its model parameters can somehow be related to fundamental constants of QCD. We
shall give a first answer in this article by comparing the predictions of both
approaches. Building on results from a previous study on heavy meson
spectroscopy, we calibrate the Cornell model employing NRQCD predictions for
the lowest-lying bottomonium states up to NLO, in which the bottom mass is
varied within a wide range. We find that the Cornell model mass parameter can
be identified, within perturbative uncertainties, with the MSR mass at the
scale GeV. This identification holds for any value of or
the bottom mass, and for all perturbative orders investigated. Furthermore, we
show that: a) the "string tension" parameter is independent of the bottom mass,
and b) the Coulomb strength of the Cornell model can be related to the
QCD strong coupling constant at a characteristic non-relativistic
scale. We also show how to remove the renormalon of the static QCD
potential and sum-up large logs related to the renormalon subtraction by
switching to the low-scale, short-distance MSR mass, and using R-evolution. Our
R-improved expression for the static potential remains independent of the heavy
quark mass value and agrees with lattice QCD results for values of the radius
as large as fm, and with the Cornell model potential at long distances.
Finally we show that for moderate values of , the R-improved NRQCD and
Cornell static potentials are in head-on agreement.Comment: 22 pages, 13 figures, 3 table
Exceptional thermodynamics: The equation of state of G(2) gauge theory
We present a lattice study of the equation of state in Yang-Mills theory
based on the exceptional G(2) gauge group. As is well-known, at zero
temperature this theory shares many qualitative features with real-world QCD,
including the absence of colored states in the spectrum and dynamical string
breaking at large distances. In agreement with previous works, we show that at
finite temperature this theory features a first-order deconfining phase
transition, whose nature can be studied by a semi-classical computation. We
also show that the equilibrium thermodynamic observables in the deconfined
phase bear striking quantitative similarities with those found in SU(N) gauge
theories: in particular, these quantities exhibit nearly perfect
proportionality to the number of gluon degrees of freedom, and the trace
anomaly reveals a characteristic quadratic dependence on the temperature, also
observed in SU(N) Yang-Mills theories (both in four and in three spacetime
dimensions). We compare our lattice data with analytical predictions from
effective models, and discuss their implications for the deconfinement
mechanism and high-temperature properties of strongly interacting,
non-supersymmetric gauge theories. Our results give strong evidence for the
conjecture that the thermal deconfining transition is governed by a universal
mechanism, common to all simple gauge groups.Comment: 1+36 pages, 8 figures; v2, 1+41 pages, 9 figures: scale setting
improved, discussion in section 1 slightly expanded, comments on the Monte
Carlo algorithm added, new references included, affiliation details for one
of the authors updated, minor misprints corrected: version published in the
journa
HoloDetect: Few-Shot Learning for Error Detection
We introduce a few-shot learning framework for error detection. We show that
data augmentation (a form of weak supervision) is key to training high-quality,
ML-based error detection models that require minimal human involvement. Our
framework consists of two parts: (1) an expressive model to learn rich
representations that capture the inherent syntactic and semantic heterogeneity
of errors; and (2) a data augmentation model that, given a small seed of clean
records, uses dataset-specific transformations to automatically generate
additional training data. Our key insight is to learn data augmentation
policies from the noisy input dataset in a weakly supervised manner. We show
that our framework detects errors with an average precision of ~94% and an
average recall of ~93% across a diverse array of datasets that exhibit
different types and amounts of errors. We compare our approach to a
comprehensive collection of error detection methods, ranging from traditional
rule-based methods to ensemble-based and active learning approaches. We show
that data augmentation yields an average improvement of 20 F1 points while it
requires access to 3x fewer labeled examples compared to other ML approaches.Comment: 18 pages
NetiNeti : Discovery of Scientific Names from Text Using Machine Learning Methods Figure 1
Figure 1 demonstrates a series of training experiments with the Naïve Bayes classifier using different neighborhoods for contextual features, different sizes of positive and
negative training examples and evaluated the resulting classifiers with our annotated
gold standard corpus.
The data sets are the results of running NetiNeti on subset of 136 PubMedCentral tagged open access articles and with no stop list.A scientific name for an organism can be associated with almost all biological data.
Name identification is an important step in many text mining tasks aiming to extract
useful information from biological, biomedical and biodiversity text sources. A
scientific name acts as an important metadata element to link biological information.We present NetiNeti, a machine learning based approach for identification and
discovery of scientific names. The system implementing the approach can be accessed
at http://namefinding.ubio.org we present the comparison results of various machine
learning algorithms on our annotated corpus. Naïve Bayes and Maximum Entropy
with Generalized Iterative Scaling (GIS) parameter estimation are the top two
performing algorithms
- …