33 research outputs found
A unifying view for performance measures in multi-class prediction
In the last few years, many different performance measures have been
introduced to overcome the weakness of the most natural metric, the Accuracy.
Among them, Matthews Correlation Coefficient has recently gained popularity
among researchers not only in machine learning but also in several application
fields such as bioinformatics. Nonetheless, further novel functions are being
proposed in literature. We show that Confusion Entropy, a recently introduced
classifier performance measure for multi-class problems, has a strong
(monotone) relation with the multi-class generalization of a classical metric,
the Matthews Correlation Coefficient. Computational evidence in support of the
claim is provided, together with an outline of the theoretical explanation
Deep Learning Based Human Emotional State Recognition in a Video
Human emotions play significant role in everyday life. There are a lot of applications of automatic emotion recognition in medicine, e-learning, monitoring, marketing etc. In this paper the method and neural network architecture for real-time human emotion recognition by audio-visual data are proposed. To classify one of seven emotions, deep neural networks, namely, convolutional and recurrent neural networks are used. Visual information is represented by a sequence of 16 frames of 96 Γ 96 pixels, and audio information - by 140 features for each of a sequence of 37 temporal windows. To reduce the number of audio features autoencoder was used. Audio information in conjunction with visual one is shown to increase recognition accuracy up to 12%. The developed system being not demanding to be computing resources is dynamic in terms of selection of parameters, reducing or increasing the number of emotion classes, as well as the ability to easily add, accumulate and use information from other external devices for further improvement of classification accuracy
ΠΠ°ΡΠΈΠ°ΡΠΈΡ ΠΏΠΈΠ³ΠΌΠ΅Π½ΡΠ°ΡΠΈΠΈ ΡΠ°Π΄ΡΠΆΠΊΠΈ Π³Π»Π°Π· Π±Π΅Π»ΠΎΡΡΡΡΠΊΠΎΠΉ ΠΏΠΎΠΏΡΠ»ΡΡΠΈΠΈ Π² ΡΠ²ΡΠ·ΠΈ Ρ ΠΏΠΎΠ»ΠΈΠΌΠΎΡΡΠΈΠ·ΠΌΠΎΠΌ Π³Π΅Π½ΠΎΠ² HERC2 ΠΈ OCA2
The human genetic phenotyping is one of the most intensely developing area of forensic genetics. Externally visible traits, including eye color, can be predicted by analyzing single nucleotide polymorphisms with a high predictive rate. We studied the polymorphisms rs12913832 and rs1800407 in the HERC2 and OCA2 genes, respectively, to evaluate its prognostic availability in relation to the iris pigmentation of the Belarusian population. For this, both eye images and DNA samples were collected from 314 individuals to analyze the key polymorphisms by the TaqMan assay. Our data confirmed a relevance of rs12913832:A>G and rs1800407:G>A in the prediction context. The highest values of the sensitivity (SE = 0.94) and the specificity (SP = 0.90) were obtained for rs12913832, demonstrating the high efficiency of this marker as a classifier of phenotypic groups. The presence of the ancestral dominant allele rs12913832-A causes a dark (brown) iris pigmentation, how- ever, the heterozygous state rs12913832:GA includes a range of mixed variants. The predictive value of rs1800407 for the genetic phenotyping is highly significant (SE = 0.98), but has a low specificity (SP = 0.14), thus rs1800407, not being an effective classifier, can be used as an auxiliary in the eye color predictive model. The analysis of a cumulative impact of the both poly- morphisms on the iris color variation shows their high prospects for the genetic phenotyping of the Belarusian population.ΠΠ΅Π½Π΅ΡΠΈΡΠ΅ΡΠΊΠΎΠ΅ ΡΠ΅Π½ΠΎΡΠΈΠΏΠΈΡΠΎΠ²Π°Π½ΠΈΠ΅ ΡΠ΅Π»ΠΎΠ²Π΅ΠΊΠ° β Π½ΠΎΠ²ΠΎΠ΅, ΠΈΠ½ΡΠ΅Π½ΡΠΈΠ²Π½ΠΎ ΡΠ°Π·Π²ΠΈΠ²Π°ΡΡΠ΅Π΅ΡΡ Π½Π°ΠΏΡΠ°Π²Π»Π΅Π½ΠΈΠ΅ ΠΊΡΠΈΠΌΠΈΠ½Π°Π»ΠΈΡΡΠΈΡΠ΅ΡΠΊΠΎΠΉ Π³Π΅Π½Π΅ΡΠΈΠΊΠΈ. ΠΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠ΅ Π³Π΅Π½Π΅ΡΠΈΡΠ΅ΡΠΊΠΈΡ
ΠΎΡΠ½ΠΎΠ² ΡΠ²Π΅ΡΠΎΠ²ΠΎΠΉ Π²Π°ΡΠΈΠ°ΡΠΈΠΈ Π³Π»Π°Π· ΡΠ²Π»ΡΠ΅ΡΡΡ ΠΎΠ΄Π½ΠΈΠΌ ΠΈΠ· Π½Π°ΠΈΠ±ΠΎΠ»Π΅Π΅ ΠΏΠ΅ΡΡΠΏΠ΅ΠΊΡΠΈΠ²Π½ΡΡ
ΡΡΠ΅Π΄ΠΈ ΠΏΠΎΠ΄Ρ
ΠΎΠ΄ΠΎΠ², Π½Π°ΡΠ΅Π»Π΅Π½Π½ΡΡ
Π½Π° ΡΡΡΠ°Π½ΠΎΠ²Π»Π΅Π½ΠΈΠ΅ ΠΎΠ±Π»ΠΈΠΊΠ° Π½Π΅ΠΈΠ·Π²Π΅ΡΡΠ½ΠΎΠ³ΠΎ ΠΈΠ½Π΄ΠΈΠ²ΠΈΠ΄Π° ΠΏΠΎ Ρ
Π°ΡΠ°ΠΊΡΠ΅ΡΠΈΡΡΠΈΠΊΠ°ΠΌ Π΅Π³ΠΎ ΠΠΠ. Π Π΄Π°Π½Π½ΠΎΠΉ ΡΠ°Π±ΠΎΡΠ΅ ΠΏΡΠΎΠ²Π΅Π΄Π΅Π½ΠΎ ΠΈΠ·ΡΡΠ΅Π½ΠΈΠ΅ ΠΏΠΎΠ»ΠΈΠΌΠΎΡΡΠΈΠ·ΠΌΠΎΠ² rs12913832 ΠΈ rs1800407 Π² Π³Π΅Π½Π°Ρ
HERC2 ΠΈ ΠΠ‘A2 ΡΠΎΠΎΡΠ²Π΅ΡΡΡΠ²Π΅Π½Π½ΠΎ Π² ΡΠ²ΡΠ·ΠΈ Ρ ΠΏΠΈΠ³ΠΌΠ΅Π½ΡΠ°ΡΠΈΠ΅ΠΉ ΡΠ°Π΄ΡΠΆΠΊΠΈ Π³Π»Π°Π· Π±Π΅Π»ΠΎΡΡΡΡΠΊΠΎΠΉ ΠΏΠΎΠΏΡΠ»ΡΡΠΈΠΈ ΠΈ Π΄Π°Π½Π° ΠΎΡΠ΅Π½ΠΊΠ° ΠΈΡ
ΠΏΡΠΎΠ³Π½ΠΎΡΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΡΠ΅ΠΊΡΠΈΠ²Π½ΠΎΡΡΠΈ Π΄Π»Ρ Π³Π΅Π½Π΅ΡΠΈΡΠ΅ΡΠΊΠΎΠ³ΠΎ ΡΠ΅Π½ΠΎΡΠΈΠΏΠΈΡΠΎΠ²Π°Π½ΠΈΡ. ΠΠΎΠ»ΡΡΠ΅Π½Π½ΡΠ΅ Π΄Π°Π½Π½ΡΠ΅ ΠΏΠΎΠ΄ΡΠ²Π΅ΡΠ΄ΠΈΠ»ΠΈ Π·Π½Π°ΡΠΈΠΌΡΠΉ Π²ΠΊΠ»Π°Π΄ Π² ΡΠ²Π΅ΡΠΎΠ²ΡΡ Π²Π°ΡΠΈΠ°ΡΠΈΡ ΡΠ°Π΄ΡΠΆΠΊΠΈ Π³Π»Π°Π· rs12913832:A>G ΠΈ rs1800407:G>A. ΠΡΡΠΎΠΊΠΈΠ΅ Π·Π½Π°ΡΠ΅Π½ΠΈΡ ΡΡΠ²ΡΡΠ²ΠΈΡΠ΅Π»ΡΠ½ΠΎΡΡΠΈ (SE = 0,94) ΠΈ ΡΠΏΠ΅ΡΠΈΡΠΈΡΠ½ΠΎΡΡΠΈ (SP = 0,90) Π±ΡΠ»ΠΈ ΠΏΠΎΠ»ΡΡΠ΅Π½Ρ Π΄Π»Ρ rs12913832, ΠΏΠΎΠ΄ΡΠ²Π΅ΡΠ΄ΠΈΠ² ΡΡΡΠ΅ΠΊΡΠΈΠ²Π½ΠΎΡΡΡ Π΄Π°Π½Π½ΠΎΠ³ΠΎ ΠΌΠ°ΡΠΊΠ΅ΡΠ° Π΄Π»Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΡ Π² ΠΊΠ°ΡΠ΅ΡΡΠ²Π΅ ΠΊΠ»Π°ΡΡΠΈΡΠΈΠΊΠ°ΡΠΎΡΠ° ΡΠ΅Π½ΠΎΡΠΈΠΏΠΈΡΠ΅ΡΠΊΠΈΡ
Π³ΡΡΠΏΠΏ. ΠΠ°Π»ΠΈΡΠΈΠ΅ ΠΏΡΠ΅Π΄ΠΊΠΎΠ²ΠΎΠ³ΠΎ Π΄ΠΎΠΌΠΈΠ½Π°Π½ΡΠ½ΠΎΠ³ΠΎ Π°Π»Π»Π΅Π»Ρ rs12913832-A ΠΎΠ±ΡΡΠ»ΠΎΠ²Π»ΠΈΠ²Π°Π΅Ρ ΡΠ΅ΠΌΠ½ΡΡ ΠΏΠΈΠ³ΠΌΠ΅Π½ΡΠ°ΡΠΈΡ ΡΠ°Π΄ΡΠΆΠΊΠΈ, ΠΎΠ΄Π½Π°ΠΊΠΎ Π³Π΅ΡΠ΅ΡΠΎΠ·ΠΈΠ³ΠΎΡΠ½ΠΎΠ΅ Π½ΠΎΡΠΈΡΠ΅Π»ΡΡΡΠ²ΠΎ rs12913832:GA Π²ΠΊΠ»ΡΡΠ°Π΅Ρ Π·Π½Π°ΡΠΈΡΠ΅Π»ΡΠ½ΡΠΉ ΡΠΏΠ΅ΠΊΡΡ ΡΠΌΠ΅ΡΠ°Π½Π½ΡΡ
Π²Π°ΡΠΈΠ°Π½ΡΠΎΠ². ΠΠ΄Π½ΠΎΠ½ΡΠΊΠ»Π΅ΠΎΡΠΈΠ΄Π½ΡΠΉ ΠΏΠΎΠ»ΠΈΠΌΠΎΡΡΠΈΠ·ΠΌ rs1800407 Ρ
Π°ΡΠ°ΠΊΡΠ΅ΡΠΈΠ·ΡΠ΅ΡΡΡ Π²ΡΡΠΎΠΊΠΎΠΉ ΡΡΠ²ΡΡΠ²ΠΈΡΠ΅Π»ΡΠ½ΠΎΡΡΡΡ (SE = 0,98), ΠΎΠ΄Π½Π°ΠΊΠΎ ΠΈΠΌΠ΅Π΅Ρ Π½ΠΈΠ·ΠΊΠΎΠ΅ Π·Π½Π°ΡΠ΅Π½ΠΈΠ΅ ΡΠΏΠ΅ΡΠΈΡΠΈΡΠ½ΠΎΡΡΠΈ (SP = 0,14), ΡΠ»Π΅Π΄ΠΎΠ²Π°ΡΠ΅Π»ΡΠ½ΠΎ, Π΄Π°Π½Π½ΡΠΉ ΠΌΠ°ΡΠΊΠ΅Ρ, Π½Π΅ ΡΠ²Π»ΡΡΡΡ ΡΡΡΠ΅ΠΊΡΠΈΠ²Π½ΡΠΌ ΠΊΠ»Π°ΡΡΠΈΡΠΈΠΊΠ°ΡΠΎΡΠΎΠΌ, ΠΌΠΎΠΆΠ΅Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°ΡΡΡΡ ΡΠΎΠ»ΡΠΊΠΎ ΠΊΠ°ΠΊ Π²ΡΠΏΠΎΠΌΠΎΠ³Π°ΡΠ΅Π»ΡΠ½ΡΠΉ ΠΈΠ½ΡΡΡΡΠΌΠ΅Π½Ρ Π΄Π»Ρ ΠΏΡΠ΅Π΄ΡΠΊΠ°Π·Π°Π½ΠΈΡ ΡΠ²Π΅ΡΠ° Π³Π»Π°Π·. ΠΡΠ΅Π½ΠΊΠ° ΡΠΎΠ²ΠΎΠΊΡΠΏΠ½ΠΎΠ³ΠΎ Π²ΠΊΠ»Π°Π΄Π° ΠΈΠ·ΡΡΠ΅Π½Π½ΡΡ
ΠΏΠΎΠ»ΠΈΠΌΠΎΡΡΠΈΠ·ΠΌΠΎΠ² Π² ΡΠ²Π΅ΡΠΎΠ²ΡΡ Π²Π°ΡΠΈΠ°ΡΠΈΡ ΡΠ°Π΄ΡΠΆΠΊΠΈ Π³Π»Π°Π· Π±Π΅Π»ΠΎΡΡΡΡΠΊΠΎΠΉ ΠΏΠΎΠΏΡΠ»ΡΡΠΈΠΈ ΠΏΠΎΠΊΠ°Π·ΡΠ²Π°Π΅Ρ ΠΈΡ
Π²ΡΡΠΎΠΊΠΈΠΉ ΠΏΡΠΎΠ³Π½ΠΎΡΡΠΈΡΠ΅ΡΠΊΠΈΠΉ ΠΏΠΎΡΠ΅Π½ΡΠΈΠ°Π» Π΄Π»Ρ Π³Π΅Π½Π΅ΡΠΈΡΠ΅ΡΠΊΠΎΠ³ΠΎ ΡΠ΅Π½ΠΎΡΠΈΠΏΠΈΡΠΎΠ²Π°Π½ΠΈΡ
Usefulness of the SF-36 Health Survey questionnaire in screening for health-related quality of life among parents of children with cancer: Latent profile analysis
Background: Poor health-related quality of life (HRQOL) of parents of children with cancer as their main caregivers can adversely affect childβs HRQOL. Short Form-36 Health Survey (SF-36) is a widely used instrument to measure HRQOL. However, there are no clearly defined cut-off points for screening for parents with poor HRQOL. This study aimed to find appropriate cut-off points for the SF-36 questionnaire in a sample of parents of children with cancer using latent profile analysis to add another possibility to use it.
Methods: In this cross-sectional study, 110 couples (110 mothers and 110 fathers) of children diagnosed with cancer selected by simple random sampling method from the patients' files were included. The study was conducted at two settings, pediatric hematology ward and pediatric hematology clinic of a university hospital in 2016-2017. Latent Profile analysis method was used to determine appropriate cut-off points for the SF-36 questionnaire. Data was analyzed by Mplus and R3.3.0 software.
Results:
Based on the results, scores β€44, 45-63 and β₯64 for mental health, and scores β€43, 44-59 and β₯60 for physical health classes indicate weak, medium, and good, respectively. These cut-off points showed acceptable accuracy in classification of individuals. For the total quality of life, correct classification rates were 88%, 65% and 53% for each class respectively. For mental health (physical health), they were 79 (63), 50 (62) and 52 (63) for each class respectively.
Conclusion: The cut-off points for the classes identified here can be useful in screening parents of children with cancer in clinical setting to provide clinical interventions to protect vulnerable parents from negative outcomes
Quantifying Explainability of Saliency Methods in Deep Neural Networks
One way to achieve eXplainable artificial intelligence (XAI) is through the
use of post-hoc analysis methods. In particular, methods that generate heatmaps
have been used to explain black-box models, such as deep neural network. In
some cases, heatmaps are appealing due to the intuitive and visual ways to
understand them. However, quantitative analysis that demonstrates the actual
potential of heatmaps have been lacking, and comparison between different
methods are not standardized as well. In this paper, we introduce a synthetic
dataset that can be generated adhoc along with the ground-truth heatmaps for
better quantitative assessment. Each sample data is an image of a cell with
easily distinguishable features, facilitating a more transparent assessment of
different XAI methods. Comparison and recommendations are made, shortcomings
are clarified along with suggestions for future research directions to handle
the finer details of select post-hoc analysis methods
DiagTest3Grp: An R Package for Analyzing Diagnostic Tests with Three Ordinal Groups
Medical researchers endeavor to identify potentially useful biomarkers to develop markerbased screening assays for disease diagnosis and prevention. Useful summary measures which properly evaluate the discriminative ability of diagnostic markers are critical for this purpose. Literature and existing software, for example, R packages nicely cover summary measures for diagnostic markers used for the binary case (e.g., healthy vs. diseased). An intermediate population at an early disease stage usually exists between the healthy and the fully diseased population in many disease processes. Supporting utilities for threegroup diagnostic tests are highly desired and important for identifying patients at the early disease stage for timely treatments. However, application packages which provide summary measures for three ordinal groups are currently lacking. This paper focuses on two summary measures of diagnostic accuracyβvolume under the receiver operating characteristic surface and the extended Youden index, with three diagnostic groups. We provide the R package DiagTest3Grp to estimate, under both parametric and nonparametric assumptions, the two summary measures and the associated variances, as well as the optimal cut-points for disease diagnosis. An omnibus test for multiple markers and a Wald test for two markers, on independent or paired samples, are incorporated to compare diagnostic accuracy across biomarkers. Sample size calculation under the normality assumption can be performed in the R package to design future diagnostic studies. A real world application evaluating the diagnostic accuracy of neuropsychological markers for Alzheimerβs disease is used to guide readers through step-by-step implementation of DiagTest3Grp to demonstrate its utility
Multi-objective optimisation for receiver operating characteristic analysis
Copyright Β© 2006 Springer-Verlag Berlin Heidelberg. The final publication is available at link.springer.comBook title: Multi-Objective Machine LearningSummary
Receiver operating characteristic (ROC) analysis is now a standard tool for the comparison of binary classifiers and the selection operating parameters when the costs of misclassification are unknown.
This chapter outlines the use of evolutionary multi-objective optimisation techniques for ROC analysis, in both its traditional binary classification setting, and in the novel multi-class ROC situation.
Methods for comparing classifier performance in the multi-class case, based on an analogue of the Gini coefficient, are described, which leads to a natural method of selecting the classifier operating point. Illustrations are given concerning synthetic data and an application to Short Term Conflict Alert