31,325 research outputs found

    Singing voice correction using canonical time warping

    Full text link
    Expressive singing voice correction is an appealing but challenging problem. A robust time-warping algorithm which synchronizes two singing recordings can provide a promising solution. We thereby propose to address the problem by canonical time warping (CTW) which aligns amateur singing recordings to professional ones. A new pitch contour is generated given the alignment information, and a pitch-corrected singing is synthesized back through the vocoder. The objective evaluation shows that CTW is robust against pitch-shifting and time-stretching effects, and the subjective test demonstrates that CTW prevails the other methods including DTW and the commercial auto-tuning software. Finally, we demonstrate the applicability of the proposed method in a practical, real-world scenario

    A Phase Vocoder based on Nonstationary Gabor Frames

    Full text link
    We propose a new algorithm for time stretching music signals based on the theory of nonstationary Gabor frames (NSGFs). The algorithm extends the techniques of the classical phase vocoder (PV) by incorporating adaptive time-frequency (TF) representations and adaptive phase locking. The adaptive TF representations imply good time resolution for the onsets of attack transients and good frequency resolution for the sinusoidal components. We estimate the phase values only at peak channels and the remaining phases are then locked to the values of the peaks in an adaptive manner. During attack transients we keep the stretch factor equal to one and we propose a new strategy for determining which channels are relevant for reinitializing the corresponding phase values. In contrast to previously published algorithms we use a non-uniform NSGF to obtain a low redundancy of the corresponding TF representation. We show that with just three times as many TF coefficients as signal samples, artifacts such as phasiness and transient smearing can be greatly reduced compared to the classical PV. The proposed algorithm is tested on both synthetic and real world signals and compared with state of the art algorithms in a reproducible manner.Comment: 10 pages, 6 figure

    Methods of Improving Speech Intelligibility for Listeners with Hearing Resolution Deficit

    Get PDF
    ABSTRACT: Methods developed for real-time time scale modification (TSM) of speech signal are presented. They are based on the non-uniform, speech rate depended SOLA algorithm (Synchronous Overlap and Add). Influence of the proposed method on the intelligibility of speech was investigated for two separate groups of listeners, i.e. hearing impaired children and elderly listeners. It was shown that for the speech with average rate equal to or higher than 6.48 vowels/s, all of the proposed methods have statistically significant impact on the improvement of speech intelligibility for hearing impaired children with reduced hearing resolution and one of the proposed methods significantly improves comprehension of speech in the group of elderly listeners with reduced hearing resolution. VIRTUAL SLIDES: http://www.diagnosticpathology.diagnomx.eu/vs/206548637176199

    Dynamical variety of shapes in financial multifractality

    Full text link
    The concept of multifractality offers a powerful formal tool to filter out multitude of the most relevant characteristics of complex time series. The related studies thus far presented in the scientific literature typically limit themselves to evaluation of whether or not a time series is multifractal and width of the resulting singularity spectrum is considered a measure of the degree of complexity involved. However, the character of the complexity of time series generated by the natural processes usually appears much more intricate than such a bare statement can reflect. As an example, based on the long-term records of S&P500 and NASDAQ - the two world leading stock market indices - the present study shows that they indeed develop the multifractal features, but these features evolve through a variety of shapes, most often strongly asymmetric, whose changes typically are correlated with the historically most significant events experienced by the world economy. Relating at the same time the index multifractal singularity spectra to those of the component stocks that form this index reflects the varying degree of correlations involved among the stocks.Comment: 26 pages, 10 figure
    corecore