2,868 research outputs found
Spectral analysis of pathological acoustic speech waveforms
Biomedical engineering is the application of engineering principles and techniques to the medical field. The design and problem solving skills of engineering are combined with medical and biological science, which improves medical disorder diagnosis and treatment. The purpose of this study is to develop an automated procedure for detecting excessive jitter in speech signals, which is useful for differentiating normal from pathologic speech. The fundamental motivation for this research is that tools are needed by speech pathologists and laryngologists for use in the early detection and treatment of laryngeal disorders. Acoustical analysis of speech was performed to analyze various features of a speech signal. Earlier research established a relation between pitch period jitter and harmonic bandwidth. This concept was used for detecting laryngeal disorders in speech since pathologic speech has been found to have larger amounts of jitter than normal speech.
Our study was performed using vowel samples from the voice disorder database recorded at the Massachusetts Eye and Ear Infirmary (MEEI) in1994. The KAYPENTAX company markets this database. Software development was conducted using MATLAB, a user-friendly programming language which has been applied widely for signal processing. An algorithm was developed to compute harmonic bandwidths for various speech samples of sustained vowel sounds. Open and closed tests were conducted on 23 samples of pathologic and normal speech samples each. Classification results showed 69.56% probability of correct detection of pathologic speech samples during an open test
Fault Analysis of Electromechanical Systems using Information Entropy Concepts
Fault analysis of mechanical and electromechanical systems has been a subject of considerable interest in the systems and control research community. Entropy, under its various formulations is an important variable, which is unrivaled when it comes to measuring order (or organization) and/or disorder (or disorganization). Researchers have successfully used entropy based concepts to solve various challenging problems in engineering, mathematics, meteorology, biotechnology, medicine, statistics etc. This research tries to analyze faults in electromechanical systems using information entropy concepts. The objectives of this research are to develop a method to evaluate signal entropy of a dynamical system using only input/output measurements, and to use this entropy measure to analyze faults within a dynamical system. Given discrete-time signals corresponding to the three-phase voltages and currents of an electromechanical system being monitored, the problem is to analyze whether or not this system is healthy.
The concepts of Shannon entropy and relative entropy come from the field of Information Theory. They measure the degree of uncertainty that exists in a system. The main idea behind this approach is that the system's dynamics may have regularities hidden in measurements that are not obvious to see. The Shannon entropy and relative entropy measures are calculated by using probability distribution functions (PDF) that are formed by sampling the time series currents and voltages of a system. The system's health is monitored by, first, sampling the currents and voltages at certain time intervals, then generating the corresponding PDFs and, finally, calculating the information entropy measures. If the system dynamics are unchanged, or in other words, the system continues to be healthy, then the relative entropy measures will be consistently low or constant. But, if the system dynamics change due to damage, then the corresponding relative entropy and Shannon entropy measures will be increasing compared to the entropy of the system with less damage
Recommended from our members
Parallels in the sequential organization of birdsong and human speech.
Human speech possesses a rich hierarchical structure that allows for meaning to be altered by words spaced far apart in time. Conversely, the sequential structure of nonhuman communication is thought to follow non-hierarchical Markovian dynamics operating over only short distances. Here, we show that human speech and birdsong share a similar sequential structure indicative of both hierarchical and Markovian organization. We analyze the sequential dynamics of song from multiple songbird species and speech from multiple languages by modeling the information content of signals as a function of the sequential distance between vocal elements. Across short sequence-distances, an exponential decay dominates the information in speech and birdsong, consistent with underlying Markovian processes. At longer sequence-distances, the decay in information follows a power law, consistent with underlying hierarchical processes. Thus, the sequential organization of acoustic elements in two learned vocal communication signals (speech and birdsong) shows functionally equivalent dynamics, governed by similar processes
Analysis and correction of the helium speech effect by autoregressive signal processing
SIGLELD:D48902/84 / BLDSC - British Library Document Supply CentreGBUnited Kingdo
Models and analysis of vocal emissions for biomedical applications: 5th International Workshop: December 13-15, 2007, Firenze, Italy
The MAVEBA Workshop proceedings, held on a biannual basis, collect the scientific papers presented both as oral and poster contributions, during the conference. The main subjects are: development of theoretical and mechanical models as an aid to the study of main phonatory dysfunctions, as well as the biomedical engineering methods for the analysis of voice signals and images, as a support to clinical diagnosis and classification of vocal pathologies. The Workshop has the sponsorship of: Ente Cassa Risparmio di Firenze, COST Action 2103, Biomedical Signal Processing and Control Journal (Elsevier Eds.), IEEE Biomedical Engineering Soc. Special Issues of International Journals have been, and will be, published, collecting selected papers from the conference
Analysis and Detection of Pathological Voice using Glottal Source Features
Automatic detection of voice pathology enables objective assessment and
earlier intervention for the diagnosis. This study provides a systematic
analysis of glottal source features and investigates their effectiveness in
voice pathology detection. Glottal source features are extracted using glottal
flows estimated with the quasi-closed phase (QCP) glottal inverse filtering
method, using approximate glottal source signals computed with the zero
frequency filtering (ZFF) method, and using acoustic voice signals directly. In
addition, we propose to derive mel-frequency cepstral coefficients (MFCCs) from
the glottal source waveforms computed by QCP and ZFF to effectively capture the
variations in glottal source spectra of pathological voice. Experiments were
carried out using two databases, the Hospital Universitario Principe de
Asturias (HUPA) database and the Saarbrucken Voice Disorders (SVD) database.
Analysis of features revealed that the glottal source contains information that
discriminates normal and pathological voice. Pathology detection experiments
were carried out using support vector machine (SVM). From the detection
experiments it was observed that the performance achieved with the studied
glottal source features is comparable or better than that of conventional MFCCs
and perceptual linear prediction (PLP) features. The best detection performance
was achieved when the glottal source features were combined with the
conventional MFCCs and PLP features, which indicates the complementary nature
of the features
Models and analysis of vocal emissions for biomedical applications
This book of Proceedings collects the papers presented at the 3rd International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, MAVEBA 2003, held 10-12 December 2003, Firenze, Italy. The workshop is organised every two years, and aims to stimulate contacts between specialists active in research and industrial developments, in the area of voice analysis for biomedical applications. The scope of the Workshop includes all aspects of voice modelling and analysis, ranging from fundamental research to all kinds of biomedical applications and related established and advanced technologies
Maximum likelihood Linear Programming Data Fusion for Speaker Recognition
Biometric system performance can be improved by means of data fusion. Several kinds of
information can be fused in order to obtain a more accurate classification (identification or
verification) of an input sample. In this paper we present a method for computing the
weights in a weighted sum fusion for score combinations, by means of a likelihood model.
The maximum likelihood estimation is set as a linear programming problem. The scores are
derived from a GMM classifier working on a different feature extractor. Our experimental
results assesed the robustness of the system in front a changes on time (different sessions)
and robustness in front a change of microphone. The improvements obtained were
significantly better (error bars of two standard deviations) than a uniform weighted sum or a
uniform weighted product or the best single classifier. The proposed method scales
computationaly with the number of scores to be fussioned as the simplex method for linear
programming
- …