62,114 research outputs found

    A Simple Correlation-Based Model of Intelligibility for Nonlinear Speech Enhancement and Separation

    Get PDF
    Applying a binary mask to a pure noise signal can result in speech that is highly intelligible, despite the absence of any of the target speech signal. Therefore, to estimate the intelligibility benefit of highly nonlinear speech enhancement techniques, we contend that SNR is not useful; instead we propose a measure based on the similarity between the time-varying spectral envelopes of target speech and system output, as measured by correlation. As with previous correlation-based intelligibility measures, our system can broadly match subjective intelligibility for a range of enhanced signals. Our system, however, is notably simpler and we explain the practical motivation behind each stage. This measure, freely available as a small Matlab implementation, can provide a more meaningful evaluation measure for nonlinear speech enhancement systems, as well as providing a transparent objective function for the optimization of such systems

    Twin Learning for Similarity and Clustering: A Unified Kernel Approach

    Full text link
    Many similarity-based clustering methods work in two separate steps including similarity matrix computation and subsequent spectral clustering. However, similarity measurement is challenging because it is usually impacted by many factors, e.g., the choice of similarity metric, neighborhood size, scale of data, noise and outliers. Thus the learned similarity matrix is often not suitable, let alone optimal, for the subsequent clustering. In addition, nonlinear similarity often exists in many real world data which, however, has not been effectively considered by most existing methods. To tackle these two challenges, we propose a model to simultaneously learn cluster indicator matrix and similarity information in kernel spaces in a principled way. We show theoretical relationships to kernel k-means, k-means, and spectral clustering methods. Then, to address the practical issue of how to select the most suitable kernel for a particular clustering task, we further extend our model with a multiple kernel learning ability. With this joint model, we can automatically accomplish three subtasks of finding the best cluster indicator matrix, the most accurate similarity relations and the optimal combination of multiple kernels. By leveraging the interactions between these three subtasks in a joint framework, each subtask can be iteratively boosted by using the results of the others towards an overall optimal solution. Extensive experiments are performed to demonstrate the effectiveness of our method.Comment: Published in AAAI 201

    A data-driven functional projection approach for the selection of feature ranges in spectra with ICA or cluster analysis

    Get PDF
    Prediction problems from spectra are largely encountered in chemometry. In addition to accurate predictions, it is often needed to extract information about which wavelengths in the spectra contribute in an effective way to the quality of the prediction. This implies to select wavelengths (or wavelength intervals), a problem associated to variable selection. In this paper, it is shown how this problem may be tackled in the specific case of smooth (for example infrared) spectra. The functional character of the spectra (their smoothness) is taken into account through a functional variable projection procedure. Contrarily to standard approaches, the projection is performed on a basis that is driven by the spectra themselves, in order to best fit their characteristics. The methodology is illustrated by two examples of functional projection, using Independent Component Analysis and functional variable clustering, respectively. The performances on two standard infrared spectra benchmarks are illustrated.Comment: A paraitr

    Comment of Global dynamics of biological systems

    Full text link
    In a recent study, (Grigorov, 2006) analyzed temporal gene expression profiles (Arbeitman et al., 2002) generated in a Drosophila experiment using SSA in conjunction with Monte-Carlo SSA. The author (Grigorov, 2006) makes three important claims in his article, namely: Claim1: A new method based on the theory of nonlinear time series analysis is used to capture the global dynamics of the fruit-fly cycle temporal gene expression profiles. Claim 2: Flattening of a significant part of the eigen-spectrum confirms the hypothesis about an underly-ing high-dimensional chaotic generating process. Claim 3: Monte-Carlo SSA can be used to establish whether a given time series is distinguishable from any well-defined process including deterministic chaos. In this report we present fundamental concerns with respect to the above claims (Grigorov, 2006) in a systematic manner with simple examples. The discussion provided especially discourages the choice of SSA for inferring nonlinear dynamical structure form time series obtained in any biological paradigm.Comment: 6 pages, 2 figure
    • …
    corecore