Search CORE

25 research outputs found

Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions.

Author: Al-Nasheri Ahmed
Ali Zulfiqar
Alsulaiman Mansour
Muhammad Ghulam
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Automatic voice pathology detection and classification systems effectively contribute to the assessment of voice disorders, which helps clinicians to detect the existence of any voice pathologies and the type of pathology from which patients suffer in the early stages. This work concentrates on developing an accurate and robust feature extraction for detecting and classifying voice pathologies by investigating different frequency bands using correlation functions. In this paper, we extracted maximum peak values and their corresponding lag values from each frame of a voiced signal by using correlation functions as features to detect and classify pathological samples. These features are investigated in different frequency bands to see the contribution of each band on the detection and classification processes.Various samples of sustained vowel /a/ of normal and pathological voices were extracted from three different databases: English, German, and Arabic. A support vector machine was used as a classifier. We also performed a t test to investigate the significant differences in mean of normal and pathological samples.The best achieved accuracies in both detection and classification were varied depending on the band, the correlation function, and the database. The most contributive bands in both detection and classification were between 1000 and 8000 Hz. In detection, the highest acquired accuracies when using cross-correlation were 99.809%, 90.979%, and 91.168% in the Massachusetts Eye and Ear Infirmary, Saarbruecken Voice Database, and Arabic Voice Pathology Database databases, respectively. However, in classification, the highest acquired accuracies when using cross-correlation were 99.255%, 98.941%, and 95.188% in the three databases, respectively

University of Essex Research Repository

Crossref

Ulster University's Research Portal

Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions

Author: Al-Nasheri Ahmed
Ali Zulfiqar
Alsulaiman Mansour
Farahat Ibrahim Mohamed
Malki Khalid H.
Mesallam Tamer A.
Muhammad Ghulam
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/04/2017
Field of study

Crossref

Ulster University's Research Portal

Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms

Author: Al-nasheri Ahmed
Ali Zulfiqar
Alsulaiman Mansour
Farahat Mohamed
Malki Khalid H
Mesallam Tamer A
Muhammad Ghulam
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2017
Field of study

A voice disorder database is an essential element in doing research on automatic voice disorder detection and classification. Ethnicity affects the voice characteristics of a person, and so it is necessary to develop a database by collecting the voice samples of the targeted ethnic group. This will enhance the chances of arriving at a global solution for the accurate and reliable diagnosis of voice disorders by understanding the characteristics of a local group. Motivated by such idea, an Arabic voice pathology database (AVPD) is designed and developed in this study by recording three vowels, running speech, and isolated words. For each recorded samples, the perceptual severity is also provided which is a unique aspect of the AVPD. During the development of the AVPD, the shortcomings of different voice disorder databases were identified so that they could be avoided in the AVPD. In addition, the AVPD is evaluated by using six different types of speech features and four types of machine learning algorithms. The results of detection and classification of voice disorders obtained with the sustained vowel and the running speech are also compared with the results of an English-language disorder database, the Massachusetts Eye and Ear Infirmary (MEEI) database

University of Essex Research Repository

Crossref

Directory of Open Access Journals

Ulster University's Research Portal

An Investigation of Multidimensional Voice Program Parameters in Three Different Databases for Voice Pathology Detection and Classification

Author: Al-nasheri Ahmed
Ali Zulfiqar
Alsulaiman Mansour
Bencherif Mohamed A
Farahat Mohamed
Malki Khalid H
Mesallam Tamer A
Muhammad Ghulam
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Background and Objective Automatic voice-pathology detection and classification systems may help clinicians to detect the existence of any voice pathologies and the type of pathology from which patients suffer in the early stages. The main aim of this paper is to investigate Multidimensional Voice Program (MDVP) parameters to automatically detect and classify the voice pathologies in multiple databases, and then to find out which parameters performed well in these two processes. Materials and Methods Samples of the sustained vowel /a/ of normal and pathological voices were extracted from three different databases, which have three voice pathologies in common. The selected databases in this study represent three distinct languages: (1) the Arabic voice pathology database; (2) the Massachusetts Eye and Ear Infirmary database (English database); and (3) the Saarbruecken Voice Database (German database). A computerized speech lab program was used to extract MDVP parameters as features, and an acoustical analysis was performed. The Fisher discrimination ratio was applied to rank the parameters. A t test was performed to highlight any significant differences in the means of the normal and pathological samples. Results The experimental results demonstrate a clear difference in the performance of the MDVP parameters using these databases. The highly ranked parameters also differed from one database to another. The best accuracies were obtained by using the three highest ranked MDVP parameters arranged according to the Fisher discrimination ratio: these accuracies were 99.68%, 88.21%, and 72.53% for the Saarbruecken Voice Database, the Massachusetts Eye and Ear Infirmary database, and the Arabic voice pathology database, respectively

University of Essex Research Repository

Crossref

Ulster University's Research Portal

Intra- and Inter-database Study for Arabic, English, and German Databases:Do Conventional Speech Features Detect Voice Pathology?

Author: Al-nasheri Ahmed
Ali Zulfiqar
Alsulaiman Mansour
Elamvazuthi Irraivan
Farahat Mohamed
Malki Khalid H.
Mesallam Tamer A.
Muhammad Ghulam
Publication venue: 'Elsevier BV'
Publication date: 01/05/2017
Field of study

A large population around the world has voice complications. Various approaches for subjective and objective evaluations have been suggested in the literature. The subjective approach strongly depends on the experience and area of expertise of a clinician, and human error cannot be neglected. On the other hand, the objective or automatic approach is noninvasive. Automatic developed systems can provide complementary information that may be helpful for a clinician in the early screening of a voice disorder. At the same time, automatic systems can be deployed in remote areas where a general practitioner can use them and may refer the patient to a specialist to avoid complications that may be life threatening. Many automatic systems for disorder detection have been developed by applying different types of conventional speech features such as the linear prediction coefficients, linear prediction cepstral coefficients, and Mel-frequency cepstral coefficients (MFCCs). This study aims to ascertain whether conventional speech features detect voice pathology reliably, and whether they can be correlated with voice quality. To investigate this, an automatic detection system based on MFCC was developed, and three different voice disorder databases were used in this study. The experimental results suggest that the accuracy of the MFCC-based system varies from database to database. The detection rate for the intra-database ranges from 72% to 95%, and that for the inter-database is from 47% to 82%. The results conclude that conventional speech features are not correlated with voice, and hence are not reliable in pathology detection

University of Essex Research Repository

Crossref

Ulster University's Research Portal

Voice pathology detection using interlaced derivative pattern on glottal source excitation

Author: Al-nasheri Ahmed
Ali Zulfiqar
Alsulaiman Mansour
Bencherif Mohamed A.
Farahat Mohamed
Malki Khalid H.
Mesallam Tamer A.
Muhammad Ghulam
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Crossref

Ulster University's Research Portal

A Generic Model for End State Prediction of Business Processes Towards Target Compliance

Author: A Al-Nasheri
H Zhang
IH Witten
L Wang
M Rosing Von
PN Taylor
S Menard
SA Alasadi
TA Mesallam
W Aalst
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/11/2019
Field of study

Crossref

Ulster University's Research Portal

An intelligent healthcare system for detection and classification to discriminate vocal fold disorders

Author: Aguilera
Al-nasheri
Ali
Ali
Ali
Arias-Londoño
Arjmandi
Arun Kumar Sangaiah
Bhattacharyya
Bishop
Brinca
Cannito
Cohen
Cordeiro
Dhingra
Falk
Fontes
Ghulam Muhammad
Harris
Hossain
Hossain
Hossain
Hossain
Hu
Jiang
Karnell
Leonard
M. Shamim Hossain
Malki
Malki
Markaki
Martins
Massachusette Eye & Ear Infirmry Voice & Speech LAB
Mau
Mehmood
Muhammad
Muhammad
Muhammad
Muhammad
Muhammad
Muhammad
Muhammad
Parsa
Redner
Rosenthal
Roy
Selim
Wang
Werth
Yang
Zhang
Zulfiqar Ali
Zwicker
Publication venue: 'Elsevier BV'
Publication date: 01/08/2018
Field of study

The growing population of senior citizens around the world will appear as a big challenge in the future and they will engage a significant portion of the healthcare facilities. Therefore, it is necessary to develop intelligent healthcare systems so that they can be deployed in smart homes and cities for remote diagnosis. To overcome the problem, an intelligent healthcare system is proposed in this study. The proposed intelligent system is based on the human auditory mechanism and capable of detection and classification of various types of the vocal fold disorders. In the proposed system, critical bandwidth phenomena by using the bandpass filters spaced over Bark scale is implemented to simulate the human auditory mechanism. Therefore, the system acts like an expert clinician who can evaluate the voice of a patient by auditory perception. The experimental results show that the proposed system can detect the pathology with an accuracy of 99.72%. Moreover, the classification accuracy for vocal fold polyp, keratosis, vocal fold paralysis, vocal fold nodules, and adductor spasmodic dysphonia is 97.54%, 99.08%, 96.75%, 98.65%, 95.83%, and 95.83%, respectively. In addition, an experiment for paralysis versus all other disorders is also conducted, and an accuracy of 99.13% is achieved. The results show that the proposed system is accurate and reliable in vocal fold disorder assessment and can be deployed successfully for remote diagnosis. Moreover, the performance of the proposed system is better as compared to existing disorder assessment systems

University of Essex Research Repository

Crossref

Ulster University's Research Portal

Vocal Acoustic Analysis – Classification of Dysphonic Voices with Artificial Neural Networks

Author: Al-nasheri
Bielamowicz
Forero
Guyon
Henríquez
Markaki
Panek
Pylypowich
Salhi
Sellam
Teixeira
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Crossref

Communications in Computer and Information Science

Author: A Al-nasheri
A Al-nasheri
AJ Orozco-Naranjo
D Hemmerling
D Martínez
G Muhammad
G Muhammad
G Muhammad
J Béranger
JL Semmlow
L Verde
LD Shriberg
MD Lytras
P Parascandolo
RJ Schalkoff
RJ Schilling
RM Summers
S Ibrahim
S Shinohara
UR Acharya
V Tscharner von
VN Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Crossref

Repositorio Institucional ITM