Search CORE

33,684 research outputs found

The computation of pitch with vectors

Author: Aluizio Arcela
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Hybrid Method for Digits Recognition using Fixed-Frame Scores and Derived Pitch

Author: B. R. Wildermoth
H. Abdulla W
H. Sakoe
M. J. Creany
M. Magimai-Doss
M. T. Hagan
N. M. Botros
S. Uma
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 11/12/2006
Field of study

This paper presents a procedure of frame normalization based on the traditional dynamic time warping (DTW) using the LPC coefficients. The redefined method is called as the DTW frame-fixing method (DTW-FF), it works by normalizing the word frames of the input against the reference frames. The enthusiasm to this study is due to neural network limitation that entails a fix number of input nodes for when processing multiple inputs in parallel. Due to this problem, this research is initiated to reduce the amount of computation and complexity in a neural network by reducing the number of inputs into the network. In this study, dynamic warping process is used, in which local distance scores of the warping path are fixed and collected so that their scores are of equal number of frames. Also studied in this paper is the consideration of pitch as a contributing feature to the speech recognition. Results showed a good performance and improvement when using pitch along with DTW-FF feature. The convergence rate between using the steepest gradient descent is also compared to another method namely conjugate gradient method. Convergence rate is also improved when conjugate gradient method is introduced in the back-propagation algorithm

Crossref

Universiti Teknologi Malaysia Institutional Repository

Fourier phase and pitch-class sum

Author: AJ Milne
C Callender
C Callender
D Tymoczko
D Tymoczko
D Tymoczko
D Tymoczko
E Amiot
I Quinn
J Yust
J Yust
J Yust
J Yust
James R Hughes
Jason Yust
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/06/2019
Field of study

Music theorists have proposed two very different geometric models of musical objects, one based on voice leading and the other based on the Fourier transform. On the surface these models are completely different, but they converge in special cases, including many geometries that are of particular analytical interest.Accepted manuscrip

Crossref

Boston University Institutional Repository (OpenBU)

A Novel Method For Speech Segmentation Based On Speakers' Characteristics

Author: Abdolali Behrouz
Sameti Hossein
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 01/04/2012
Field of study

Speech Segmentation is the process change point detection for partitioning an input audio stream into regions each of which corresponds to only one audio source or one speaker. One application of this system is in Speaker Diarization systems. There are several methods for speaker segmentation; however, most of the Speaker Diarization Systems use BIC-based Segmentation methods. The main goal of this paper is to propose a new method for speaker segmentation with higher speed than the current methods - e.g. BIC - and acceptable accuracy. Our proposed method is based on the pitch frequency of the speech. The accuracy of this method is similar to the accuracy of common speaker segmentation methods. However, its computation cost is much less than theirs. We show that our method is about 2.4 times faster than the BIC-based method, while the average accuracy of pitch-based method is slightly higher than that of the BIC-based method.Comment: 14 pages, 8 figure

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Automatic Detection of Laryngeal Pathology on Sustained Vowels Using Short-Term Cepstral Parameters: Analysis of Performance and Theoretical Justification

Author: B. Boyanov
B. Boyanov
J.G. Proakis
J.I. Godino-Llorente
J.I. Godino-Llorente
J.I. Godino-Llorente
J.R. Deller
L. Rabiner
P.J. Murphy
R.O. Duda
S. Haykin
S.E. Bou-Ghazale
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

The majority of speech signal analysis procedures for automatic detection of laryngeal pathologies mainly rely on parameters extracted from time domain processing. Moreover, calculation of these parameters often requires prior pitch period estimation; therefore, their validity heavily depends on the robustness of pitch detection. Within this paper, an alternative approach based on cepstral- domain processing is presented which has the advantage of not requiring pitch estimation, thus providing a gain in both simplicity and robustness. While the proposed scheme is similar to solutions based on Mel-frequency cepstral parameters, already present in literature, it has an easier physical interpretation while achieving similar performance standards

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

GOES-I/M ascent maneuvers from transfer orbit to station

Author: Abeyagunawardene S.
Defazio R.
Devlin S.
Elkin D.
Publication venue
Publication date
Field of study

The Geostationary Operational Environmental Satellite (GOES)-I/M station acquisition sequence consists nominally of three in-plane/out-of-plane maneuvers at apogee on the line of relative nodes and a small in-plane maneuver at perigee. Existing software to determine maneuver attitude, ignition time, and burn duration required modification to optimize the out-of-plane parts and admit the noninertial, three-axis stabilized attitude. The Modified Multiple Impulse Station Acquisition Maneuver Planning Program (SENARIO2) was developed from its predecessor, SCENARIO, to optimize the out-of-plane components of the impulsive delta-V vectors. Additional new features include commputation of short term J sub 2 perturbations and output of all premaneuver and postmaneuver orbit elements, coarse maneuver attitudes, propellant usage, spacecraft antenna aspect angles, and ground station coverage. The output data are intended to be used in the launch window computation and by the maneuver targeting computation (General Maneuver (GMAN) Program) software. The maneuver targeting computation in GMAN was modified to admit the GOES-I/M maneuver attitude. Appropriate combinations of ignition time, burn duration, and attitude enable any reasonable target orbit to be achieved

NASA Technical Reports Server

Efficient Bayesian inference for harmonic models via adaptive posterior factorization

Author: Plumbley MD
Vincent E
Publication venue: 'Elsevier BV'
Publication date: 01/01/2007
Field of study

NOTICE: this is the author’s version of a work that was accepted for publication in Neurocomputing. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in NEUROCOMPUTING, [VOL72, ISSUE 1-3, (2008)] DOI10.1016/j.neucom.2007.12.05

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

HAL Descartes

Queen Mary Research Online

Surrey Research Insight

Hal-Diderot

HAL-Rennes 1