16 research outputs found
Length analysis of speech to be recorded in the recognition of Parkinson's disease
Parkinson's disease is an incurable neurodegenerative disease to the present clinical knowledge. It is diagnosed mostly by exclusion tests. Numerous studies have confirmed that speech can be promising to suspect the presence of the disease. On the other hand, just a few researches discuss the appropriate length of the speech sample or the contribution of parts of the full-length recordings in the classification. Hence, we partitioned each original recording into four shorter samples. We trained linear and radial basis function (rbf) kernel Support Vector Machine (SVM) models separately for original recordings, each partitioned group and all partitioned samples together. We found no significant difference between the results of the rbf kernel models. However, we obtained significantly better results with a portion of the entire speech using linear kernel models. In conclusion, even a shorter piece of a longer speech may be adequate for classification
CNN AND LSTM FOR THE CLASSIFICATION OF PARKINSON'S DISEASE BASED ON THE GTCC AND MFCC
Parkinson's disease is a recognizable clinical syndrome with a variety of causes and clinical presentations; it represents a rapidly growing neurodegenerative disorder. Since about 90 percent of Parkinson's disease sufferers have some form of early speech impairment, recent studies on tele diagnosis of Parkinson's disease have focused on the recognition of voice impairments from vowel phonations or the subjects' discourse. In this paper, we present a new approach for Parkinson's disease detection from speech sounds that are based on CNN and LSTM and uses two categories of characteristics Mel Frequency Cepstral Coefficients (MFCC) and Gammatone Cepstral Coefficients (GTCC) obtained from noise-removed speech signals with comparative EMD-DWT and DWT-EMD analysis. The proposed model is divided into three stages. In the first step, noise is removed from the signals using the EMD-DWT and DWT-EMD methods. In the second step, the GTCC and MFCC are extracted from the enhanced audio signals. The classification process is carried out in the third step by feeding these features into the LSTM and CNN models, which are designed to define sequential information from the extracted features. The experiments are performed using PC-GITA and Sakar datasets and 10-fold cross validation method, the highest classification accuracy for the Sakar dataset reached 100% for both EMD-DWT-GTCC-CNN and DWT-EMD-GTCC-CNN, and for the PC-GITA dataset, the accuracy is reached 100% for EMD-DWT-GTCC-CNN and 96.55% for DWT-EMD-GTCC-CNN. The results of this study indicate that the characteristics of GTCC are more appropriate and accurate for the assessment of PD than MFCC
A Review of the Assessment Methods of Voice Disorders in the Context of Parkinson's Disease
In recent years, a significant progress in the field of research dedicated to the treatment of disabilities has been witnessed. This is particularly true for neurological diseases, which generally influence the system that controls the execution of learned motor patterns. In addition to its importance for communication with the outside world and interaction with others, the voice is a reflection of our personality, moods and emotions. It is a way to provide information on health status, shape, intentions, age and even the social environment. It is also a working tool for many, but an important element of life for all. Patients with Parkinsonβs disease (PD) are numerous and they suffer from hypokinetic dysarthria, which is manifested in all aspects of speech production: respiration, phonation, articulation, nasalization and prosody. This paper provides a review of the methods of the assessment of speech disorders in the context of PD and also discusses the limitations
It diagnostics of parkinson's disease based on voice markers and decreased motor activity
The objectives of the article to propose the method for complex recognition of Parkinson's disease using machine learning, based on markers of voice analysis and changes in patient movements on known data sets. The time-frequency function, (the wavelet function) and the Meyer kepstral coefficient function are used. The KNN algorithm and the algorithm of a two-layer neural network were used for training and testing on publicly available datasets on speech changes and motion retardation in Parkinson's disease. A Bayesian optimizer was also used to improve the hyperparameters of the KNN algorithm. The constructed models achieved an accuracy of 94.7 % and 96.2 % on a data set on speech changes in patients with Parkinson's disease and a data set on slowing down the movement of patients, respectively. The recognition results are close to the world level. The proposed technique is intended for use in the subsystem of IT diagnostics of nervous diseases
Recognition of signs of Parkinson's disease based on the analysis of voice markers and motor activity
Π Π΅ΡΠ°Π΅ΡΡΡ Π·Π°Π΄Π°ΡΠ° ΠΠ’-Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠΈ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° ΠΏΠΎ Π°Π½Π°Π»ΠΈΠ·Ρ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ Π³ΠΎΠ»ΠΎΡΠ°
ΠΈ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ². ΠΠΊΡΡΠ°Π»ΡΠ½ΠΎΡΡΡ Π·Π°Π΄Π°ΡΠΈ ΡΠ²ΡΠ·Π°Π½Π° Ρ Π½Π΅ΠΎΠ±Ρ
ΠΎΠ΄ΠΈΠΌΠΎΡΡΡΡ ΡΠ°Π½Π½Π΅ΠΉ Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠΈ
Π·Π°Π±ΠΎΠ»Π΅Π²Π°Π½ΠΈΡ. ΠΡΠ΅Π΄Π»Π°Π³Π°Π΅ΡΡΡ ΠΌΠ΅ΡΠΎΠ΄ ΠΊΠΎΠΌΠΏΠ»Π΅ΠΊΡΠ½ΠΎΠ³ΠΎ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ΠΌ ΠΌΠ°ΡΠΈΠ½Π½ΠΎΠ³ΠΎ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ, ΠΎΡΠ½ΠΎΠ²Π°Π½Π½ΡΠΉ Π½Π° Π°Π½Π°Π»ΠΈΠ·Π΅ Π³ΠΎΠ»ΠΎΡΠΎΠ²ΡΡ
ΠΌΠ°ΡΠΊΠ΅ΡΠΎΠ² ΠΈ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΠΉ Π² Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡΡ
ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ² Π½Π° ΠΈΠ·Π²Π΅ΡΡΠ½ΡΡ
Π½Π°Π±ΠΎΡΠ°Ρ
Π΄Π°Π½Π½ΡΡ
. ΠΠ΅ΡΠΎΠ΄Ρ. ΠΡΠΏΠΎΠ»ΡΠ·ΡΡΡΡΡ ΡΠ°ΡΡΠΎΡΠ½ΠΎ-Π²ΡΠ΅ΠΌΠ΅Π½Π½Π°Ρ ΡΡΠ½ΠΊΡΠΈΡ (ΡΡΠ½ΠΊΡΠΈΡ Π²Π΅ΠΉΠ²Π»Π΅ΡΠ°), ΡΡΠ½ΠΊΡΠΈΡ ΠΊΠ΅ΠΏΡΡΡΠ°Π»ΡΠ½ΠΎΠ³ΠΎ ΠΊΠΎΡΡΡΠΈΡΠΈΠ΅Π½ΡΠ° ΠΠ΅ΠΉΠ΅ΡΠ°, Π°Π»Π³ΠΎΡΠΈΡΠΌ k-Π±Π»ΠΈΠΆΠ°ΠΉΡΠΈΡ
ΡΠΎΡΠ΅Π΄Π΅ΠΉ (k-Nearest Neighbors, KNN), Π°Π»Π³ΠΎΡΠΈΡΠΌ Π΄Π²ΡΡ
ΡΠ»ΠΎΠΉΠ½ΠΎΠΉ Π½Π΅ΠΉΡΠΎΠ½Π½ΠΎΠΉ ΡΠ΅ΡΠΈ Π΄Π»Ρ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ ΠΈ ΡΠ΅ΡΡΠΈΡΠΎΠ²Π°Π½ΠΈΡ Π½Π° ΠΎΠ±ΡΠ΅Π΄ΠΎΡΡΡΠΏΠ½ΡΡ
Π½Π°Π±ΠΎΡΠ°Ρ
Π΄Π°Π½Π½ΡΡ
ΠΏΠΎ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ ΡΠ΅ΡΠΈ ΠΈ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΡΠΈ Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π°, Π° ΡΠ°ΠΊΠΆΠ΅ Π±Π°ΠΉΠ΅ΡΠΎΠ²ΡΠΊΠΈΠΉ ΠΎΠΏΡΠΈΠΌΠΈΠ·Π°ΡΠΎΡ Π΄Π»Ρ ΡΠ»ΡΡΡΠ΅Π½ΠΈΡ Π³ΠΈ-
ΠΏΠ΅ΡΠΏΠ°ΡΠ°ΠΌΠ΅ΡΡΠΎΠ² Π°Π»Π³ΠΎΡΠΈΡΠΌΠ° KNN. Π Π΅Π·ΡΠ»ΡΡΠ°ΡΡ. ΠΠ»Π³ΠΎΡΠΈΡΠΌ KNN ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ Π΄Π»Ρ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΡΠ΅ΡΠΈ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ², ΡΠΎΡΠ½ΠΎΡΡΡ ΡΠ΅ΡΡΠ° 94,7 % Π΄ΠΎΡΡΠΈΠ³Π½ΡΡΠ° ΠΏΡΠΈ Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠ΅ Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° ΠΏΠΎ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ Π³ΠΎΠ»ΠΎΡΠ°. ΠΠ»Π³ΠΎΡΠΈΡΠΌ Π±Π°ΠΉΠ΅ΡΠΎΠ²ΡΠΊΠΎΠΉ Π½Π΅ΠΉΡΠΎΠ½Π½ΠΎΠΉ ΡΠ΅ΡΠΈ ΠΏΡΠΈΠΌΠ΅Π½Π΅Π½ Π΄Π»Ρ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ², ΠΎΠ½ Π΄Π°Π» ΡΠΎΡΠ½ΠΎΡΡΡ ΡΠ΅ΡΡΠ° 96,2 %. ΠΠ°ΠΊΠ»ΡΡΠ΅Π½ΠΈΠ΅. ΠΠΎΠ»ΡΡΠ΅Π½Π½ΡΠ΅ ΡΠ΅Π·ΡΠ»ΡΡΠ°ΡΡ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° Π±Π»ΠΈΠ·ΠΊΠΈ ΠΊ ΠΌΠΈΡΠΎΠ²ΠΎΠΌΡ ΡΡΠΎΠ²Π½Ρ. ΠΠ° ΡΠΎΠΌ ΠΆΠ΅ Π½Π°Π±ΠΎΡΠ΅ Π΄Π°Π½Π½ΡΡ
ΠΏΠΎ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ ΡΠ΅ΡΠΈ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ² ΠΎΠ΄ΠΈΠ½ ΠΈΠ· Π»ΡΡΡΠΈΡ
ΠΏΠΎΠΊΠ°Π·Π°ΡΠ΅Π»Π΅ΠΉ Π·Π°ΡΡΠ±Π΅ΠΆΠ½ΡΡ
ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠΉ ΡΠΎΡΡΠ°Π²Π»ΡΠ΅Ρ 95,8 %, Π° Π½Π° Π½Π°Π±ΠΎΡΠ΅ Π΄Π°Π½Π½ΡΡ
ΠΏΠΎ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ² β 98,8 %. ΠΡΠ΅Π΄Π»Π°Π³Π°Π΅ΠΌΠ°Ρ Π°Π²ΡΠΎΡΡΠΊΠ°Ρ ΠΌΠ΅ΡΠΎΠ΄ΠΈΠΊΠ° ΠΏΡΠ΅Π΄Π½Π°Π·Π½Π°ΡΠ΅Π½Π° Π΄Π»Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΡ Π² ΠΏΠΎΠ΄ΡΠΈΡΡΠ΅ΠΌΠ΅ ΠΠ’-Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠΈ Π½Π΅Π²ΡΠΎΠ»ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΈΡ
Π·Π°Π±ΠΎΠ»Π΅Π²Π°Π½ΠΈΠΉ ΡΠΌΠ½ΠΎΠ³ΠΎ Π³ΠΎΡΠΎΠ΄Π°
Π Π°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΠ΅ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° Π½Π° ΠΎΡΠ½ΠΎΠ²Π΅ Π°Π½Π°Π»ΠΈΠ·Π° Π³ΠΎΠ»ΠΎΡΠΎΠ²ΡΡ ΠΌΠ°ΡΠΊΠ΅ΡΠΎΠ² ΠΈ Π΄Π²ΠΈΠ³Π°ΡΠ΅Π»ΡΠ½ΠΎΠΉ Π°ΠΊΡΠΈΠ²Π½ΠΎΡΡΠΈ
Objectives. The problem of IT diagnostics of signs of Parkinson's disease is solved by analyzing changes in the voice and slowing down the movement of patients. The urgency of the task is associated with the need for early diagnosis of the disease. A method of complex recognition of Parkinson's disease using machine learning is proposed, based on markers of voice analysis and changes in the patient's movements on known data sets.Methods. The time-frequency function (the wavelet function) and the Meyer kepstral coefficient function, the KNN algorithm (k-Nearest Neighbors, KNN) and the algorithm of a two-layer neural network are used for training and testing on publicly available datasets on speech changes and motion retardation in Parkinson's disease. A Bayesian optimizer is also used to improve the hyperparameters of the KNN algorithm.Results. The KNN algorithm was used for speech recognition of patients, the test accuracy of 94.7% was achieved in the diagnosis of Parkinson's disease by voice change. The Bayesian neural network algorithm was applied to recognize the slowing down of the patients' movements, it gave a test accuracy of 96.2% for the diagnosis of Parkinson's disease.Conclusion. The obtained results of recognition of signs of Parkinson's disease are close to the world level. On the same set of data on speech changes of patients, one of the best indicators of foreign studies is 95.8%. On the same set of data on motion deceleration, one of the best indicators of foreign researchers is 98.8%. The proposed author's technique is intended for use in the subsystem of IT diagnostics of neurological diseases of a Smart city.Π¦Π΅Π»ΠΈ. Π Π΅ΡΠ°Π΅ΡΡΡ Π·Π°Π΄Π°ΡΠ° ΠΠ’-Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠΈ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° ΠΏΠΎ Π°Π½Π°Π»ΠΈΠ·Ρ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ Π³ΠΎΠ»ΠΎΡΠ° ΠΈ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ². ΠΠΊΡΡΠ°Π»ΡΠ½ΠΎΡΡΡ Π·Π°Π΄Π°ΡΠΈ ΡΠ²ΡΠ·Π°Π½Π° Ρ Π½Π΅ΠΎΠ±Ρ
ΠΎΠ΄ΠΈΠΌΠΎΡΡΡΡ ΡΠ°Π½Π½Π΅ΠΉ Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠΈ Π·Π°Π±ΠΎΠ»Π΅Π²Π°Π½ΠΈΡ. ΠΡΠ΅Π΄Π»Π°Π³Π°Π΅ΡΡΡ ΠΌΠ΅ΡΠΎΠ΄ ΠΊΠΎΠΌΠΏΠ»Π΅ΠΊΡΠ½ΠΎΠ³ΠΎ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ΠΌ ΠΌΠ°ΡΠΈΠ½Π½ΠΎΠ³ΠΎ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ, ΠΎΡΠ½ΠΎΠ²Π°Π½Π½ΡΠΉ Π½Π° Π°Π½Π°Π»ΠΈΠ·Π΅ Π³ΠΎΠ»ΠΎΡΠΎΠ²ΡΡ
ΠΌΠ°ΡΠΊΠ΅ΡΠΎΠ² ΠΈ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΠΉ Π² Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡΡ
ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ² Π½Π° ΠΈΠ·Π²Π΅ΡΡΠ½ΡΡ
Π½Π°Π±ΠΎΡΠ°Ρ
Π΄Π°Π½Π½ΡΡ
.ΠΠ΅ΡΠΎΠ΄Ρ. ΠΡΠΏΠΎΠ»ΡΠ·ΡΡΡΡΡ ΡΠ°ΡΡΠΎΡΠ½ΠΎ-Π²ΡΠ΅ΠΌΠ΅Π½Π½Π°Ρ ΡΡΠ½ΠΊΡΠΈΡ (ΡΡΠ½ΠΊΡΠΈΡ Π²Π΅ΠΉΠ²Π»Π΅ΡΠ°), ΡΡΠ½ΠΊΡΠΈΡ ΠΊΠ΅ΠΏΡΡΡΠ°Π»ΡΠ½ΠΎΠ³ΠΎ ΠΊΠΎΡΡΡΠΈΡΠΈΠ΅Π½ΡΠ° ΠΠ΅ΠΉΠ΅ΡΠ°, Π°Π»Π³ΠΎΡΠΈΡΠΌ k-Π±Π»ΠΈΠΆΠ°ΠΉΡΠΈΡ
ΡΠΎΡΠ΅Π΄Π΅ΠΉ (k-Nearest Neighbors, KNN), Π°Π»Π³ΠΎΡΠΈΡΠΌ Π΄Π²ΡΡ
ΡΠ»ΠΎΠΉΠ½ΠΎΠΉ Π½Π΅ΠΉΡΠΎΠ½Π½ΠΎΠΉ ΡΠ΅ΡΠΈ Π΄Π»Ρ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ ΠΈ ΡΠ΅ΡΡΠΈΡΠΎΠ²Π°Π½ΠΈΡ Π½Π° ΠΎΠ±ΡΠ΅Π΄ΠΎΡΡΡΠΏΠ½ΡΡ
Π½Π°Π±ΠΎΡΠ°Ρ
Π΄Π°Π½Π½ΡΡ
ΠΏΠΎ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ ΡΠ΅ΡΠΈ ΠΈ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΡΠΈ Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π°, Π° ΡΠ°ΠΊΠΆΠ΅ Π±Π°ΠΉΠ΅ΡΠΎΠ²ΡΠΊΠΈΠΉ ΠΎΠΏΡΠΈΠΌΠΈΠ·Π°ΡΠΎΡ Π΄Π»Ρ ΡΠ»ΡΡΡΠ΅Π½ΠΈΡ Π³ΠΈΠΏΠ΅ΡΠΏΠ°ΡΠ°ΠΌΠ΅ΡΡΠΎΠ² Π°Π»Π³ΠΎΡΠΈΡΠΌΠ° KNN.Π Π΅Π·ΡΠ»ΡΡΠ°ΡΡ. ΠΠ»Π³ΠΎΡΠΈΡΠΌ KNN ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ Π΄Π»Ρ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΡΠ΅ΡΠΈ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ², ΡΠΎΡΠ½ΠΎΡΡΡ ΡΠ΅ΡΡΠ° 94,7 % Π΄ΠΎΡΡΠΈΠ³Π½ΡΡΠ° ΠΏΡΠΈ Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠ΅ Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° ΠΏΠΎ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ Π³ΠΎΠ»ΠΎΡΠ°. ΠΠ»Π³ΠΎΡΠΈΡΠΌ Π±Π°ΠΉΠ΅ΡΠΎΠ²ΡΠΊΠΎΠΉ Π½Π΅ΠΉΡΠΎΠ½Π½ΠΎΠΉ ΡΠ΅ΡΠΈ ΠΏΡΠΈΠΌΠ΅Π½Π΅Π½ Π΄Π»Ρ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ², ΠΎΠ½ Π΄Π°Π» ΡΠΎΡΠ½ΠΎΡΡΡ ΡΠ΅ΡΡΠ° 96,2 %.ΠΠ°ΠΊΠ»ΡΡΠ΅Π½ΠΈΠ΅. ΠΠΎΠ»ΡΡΠ΅Π½Π½ΡΠ΅ ΡΠ΅Π·ΡΠ»ΡΡΠ°ΡΡ ΡΠ°ΡΠΏΠΎΠ·Π½Π°Π²Π°Π½ΠΈΡ ΠΏΡΠΈΠ·Π½Π°ΠΊΠΎΠ² Π±ΠΎΠ»Π΅Π·Π½ΠΈ ΠΠ°ΡΠΊΠΈΠ½ΡΠΎΠ½Π° Π±Π»ΠΈΠ·ΠΊΠΈ ΠΊ ΠΌΠΈΡΠΎΠ²ΠΎΠΌΡ ΡΡΠΎΠ²Π½Ρ. ΠΠ° ΡΠΎΠΌ ΠΆΠ΅ Π½Π°Π±ΠΎΡΠ΅ Π΄Π°Π½Π½ΡΡ
ΠΏΠΎ ΠΈΠ·ΠΌΠ΅Π½Π΅Π½ΠΈΡ ΡΠ΅ΡΠΈ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ² ΠΎΠ΄ΠΈΠ½ ΠΈΠ· Π»ΡΡΡΠΈΡ
ΠΏΠΎΠΊΠ°Π·Π°ΡΠ΅Π»Π΅ΠΉ Π·Π°ΡΡΠ±Π΅ΠΆΠ½ΡΡ
ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠΉ ΡΠΎΡΡΠ°Π²Π»ΡΠ΅Ρ 95,8 %, Π° Π½Π° Π½Π°Π±ΠΎΡΠ΅ Π΄Π°Π½Π½ΡΡ
ΠΏΠΎ Π·Π°ΠΌΠ΅Π΄Π»Π΅Π½ΠΈΡ Π΄Π²ΠΈΠΆΠ΅Π½ΠΈΡ ΠΏΠ°ΡΠΈΠ΅Π½ΡΠΎΠ² - 98,8 %. ΠΡΠ΅Π΄Π»Π°Π³Π°Π΅ΠΌΠ°Ρ Π°Π²ΡΠΎΡΡΠΊΠ°Ρ ΠΌΠ΅ΡΠΎΠ΄ΠΈΠΊΠ° ΠΏΡΠ΅Π΄Π½Π°Π·Π½Π°ΡΠ΅Π½Π° Π΄Π»Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΡ Π² ΠΏΠΎΠ΄ΡΠΈΡΡΠ΅ΠΌΠ΅ ΠΠ’-Π΄ΠΈΠ°Π³Π½ΠΎΡΡΠΈΠΊΠΈ Π½Π΅Π²ΡΠΎΠ»ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΈΡ
Π·Π°Π±ΠΎΠ»Π΅Π²Π°Π½ΠΈΠΉ ΡΠΌΠ½ΠΎΠ³ΠΎ Π³ΠΎΡΠΎΠ΄Π°
A Review on Human-Computer Interaction and Intelligent Robots
In the field of artificial intelligence, humanβcomputer interaction (HCI) technology and its related intelligent robot technologies are essential and interesting contents of research. From the perspective of software algorithm and hardware system, these above-mentioned technologies study and try to build a natural HCI environment. The purpose of this research is to provide an overview of HCI and intelligent robots. This research highlights the existing technologies of listening, speaking, reading, writing, and other senses, which are widely used in human interaction. Based on these same technologies, this research introduces some intelligent robot systems and platforms. This paper also forecasts some vital challenges of researching HCI and intelligent robots. The authors hope that this work will help researchers in the field to acquire the necessary information and technologies to further conduct more advanced research
Your Voice Gave You Away: the Privacy Risks of Voice-Inferred Information
Our voices can reveal intimate details about our lives. Yet, many privacy discussions have focused on the threats from speaker recognition and speech recognition. This Note argues that this focus overlooks another privacy risk: voice-inferred information. This term describes non-obvious information drawn from voice data through a combination of machine learning, artificial intelligence, data mining, and natural language processing. Companies have latched onto voiceinferred information. Early adopters have applied the technology in situations as varied as lending risk analysis and hiring. Consumers may balk at such strategies, but the current United States privacy regime leaves voice insights unprotected. By applying a notice and consent privacy model via sector-specific statutes, the hodgepodge of U.S. federal privacy laws allows voice-inferred information to slip through the regulatory cracks. This Note reviews the current legal landscape and identifies existing gaps. It then suggests two solutions that balance voice privacy with technological innovation: purpose-based consent and independent data review boards. The first bolsters voice protection within the traditional notice and consent framework, while the second imagines a new protective scheme. Together, these solutions complement each other to afford the human voice the protection it deserves
Privacy-Protecting Techniques for Behavioral Data: A Survey
Our behavior (the way we talk, walk, or think) is unique and can be used as a biometric trait. It also correlates with sensitive attributes like emotions. Hence, techniques to protect individuals privacy against unwanted inferences are required. To consolidate knowledge in this area, we systematically reviewed applicable anonymization techniques. We taxonomize and compare existing solutions regarding privacy goals, conceptual operation, advantages, and limitations. Our analysis shows that some behavioral traits (e.g., voice) have received much attention, while others (e.g., eye-gaze, brainwaves) are mostly neglected. We also find that the evaluation methodology of behavioral anonymization techniques can be further improved
Adaptation of Speaker and Speech Recognition Methods for the Automatic Screening of Speech Disorders using Machine Learning
This PhD thesis presented methods for exploiting the non-verbal communication of individuals suffering from specific diseases or health conditions aiming to reach an automatic screening of them. More specifically, we employed one of the pillars of non-verbal communication, paralanguage, to explore techniques that could be utilized to model the speech of subjects. Paralanguage is a non-lexical component of communication that relies on intonation, pitch, speed of talking, and others, which can be processed and analyzed in an automatic manner. This is called Computational Paralinguistics, which can be defined as the study of modeling non-verbal latent patterns within the speech of a speaker by means of computational algorithms; these patterns go beyond the linguistic} approach.
By means of machine learning, we present models from distinct scenarios of both paralinguistics and pathological speech which are capable of estimating the health status of a given disease such as Alzheimer's, Parkinson's, and clinical depression, among others, in an automatic manner