4,892 research outputs found
Speaker verification using sequence discriminant support vector machines
This paper presents a text-independent speaker verification system using support vector machines (SVMs) with score-space kernels. Score-space kernels generalize Fisher kernels and are based on underlying generative models such as Gaussian mixture models (GMMs). This approach provides direct discrimination between whole sequences, in contrast with the frame-level approaches at the heart of most current systems. The resultant SVMs have a very high dimensionality since it is related to the number of parameters in the underlying generative model. To address problems that arise in the resultant optimization we introduce a technique called spherical normalization that preconditions the Hessian matrix. We have performed speaker verification experiments using the PolyVar database. The SVM system presented here reduces the relative error rates by 34% compared to a GMM likelihood ratio system
Language Identification Using Visual Features
Automatic visual language identification (VLID) is the technology of using information derived from the visual appearance and movement of the speech articulators to iden- tify the language being spoken, without the use of any audio information. This technique for language identification (LID) is useful in situations in which conventional audio processing is ineffective (very noisy environments), or impossible (no audio signal is available). Research in this field is also beneficial in the related field of automatic lip-reading. This paper introduces several methods for visual language identification (VLID). They are based upon audio LID techniques, which exploit language phonology and phonotactics to discriminate languages. We show that VLID is possible in a speaker-dependent mode by discrimi- nating different languages spoken by an individual, and we then extend the technique to speaker-independent operation, taking pains to ensure that discrimination is not due to artefacts, either visual (e.g. skin-tone) or audio (e.g. rate of speaking). Although the low accuracy of visual speech recognition currently limits the performance of VLID, we can obtain an error-rate of < 10% in discriminating between Arabic and English on 19 speakers and using about 30s of visual speech
Fast Load Balancing Approach for Growing Clusters by Bioinformatics
This paper presents Fast load balancing technique inspired by Bioinformatics
is a special case to assign a particular patient with a specialist physician
cluster at real time. The work is considered soft presentation of the Gaussian
mixture model based on the extracted features supplied by patients. Based on
the likelihood ratio test, the patient is assigned to a specialist physician
cluster. The presented algorithms efficiently handle any size and any numbers
of incoming patient requests and rapidly placed them to the specialist
physician cluster. Hence it smoothly balances the traffic load of patients even
at a hazard situation in the case of natural calamities. The simulation results
are presented with variable size of specialist physician clusters that well
address the issue for randomly growing patient size.Comment: 4 Pages, 2 figure
- ā¦