23 research outputs found
Large expert-curated database for benchmarking document similarity detection in biomedical literature search
Document recommendation systems for locating relevant literature have mostly relied on methods developed a decade ago. This is largely due to the lack of a large offline gold-standard benchmark of relevant documents that cover a variety of research fields such that newly developed literature search techniques can be compared, improved and translated into practice. To overcome this bottleneck, we have established the RElevant LIterature SearcH consortium consisting of more than 1500 scientists from 84 countries, who have collectively annotated the relevance of over 180 000 PubMed-listed articles with regard to their respective seed (input) article/s. The majority of annotations were contributed by highly experienced, original authors of the seed articles. The collected data cover 76% of all unique PubMed Medical Subject Headings descriptors. No systematic biases were observed across different experience levels, research fields or time spent on annotations. More importantly, annotations of the same document pairs contributed by different scientists were highly concordant. We further show that the three representative baseline methods used to generate recommended articles for evaluation (Okapi Best Matching 25, Term Frequency-Inverse Document Frequency and PubMed Related Articles) had similar overall performances. Additionally, we found that these methods each tend to produce distinct collections of recommended articles, suggesting that a hybrid method may be required to completely capture all relevant articles. The established database server located at https://relishdb.ict.griffith.edu.au is freely available for the downloading of annotation data and the blind testing of new methods. We expect that this benchmark will be useful for stimulating the development of new powerful techniques for title and title/abstract-based search engines for relevant articles in biomedical research.Peer reviewe
Human and machine validation of 14 databases of dynamic facial expressions
With a shift in interest toward dynamic expressions, numerous corpora of dynamic facial stimuli have been developed over the past two decades. The present research aimed to test existing sets of dynamic facial expressions (published between 2000 and 2015) in a cross-corpus validation effort. For this, 14 dynamic databases were selected that featured facial expressions of the basic six emotions (anger, disgust, fear, happiness, sadness, surprise) in posed or spontaneous form. In Study 1, a subset of stimuli from each database (N = 162) were presented to human observers and machine analysis, yielding considerable variance in emotion recognition performance across the databases. Classification accuracy further varied with perceived intensity and naturalness of the displays, with posed expressions being judged more accurately and as intense, but less natural compared to spontaneous ones. Study 2 aimed for a full validation of the 14 databases by subjecting the entire stimulus set (N = 3812) to machine analysis. A FACS-based Action Unit (AU) analysis revealed that facial AU configurations were more prototypical in posed than spontaneous expressions. The prototypicality of an expression in turn predicted emotion classification accuracy, with higher performance observed for more prototypical facial behavior. Furthermore, technical features of each database (i.e., duration, face box size, head rotation, and motion) had a significant impact on recognition accuracy. Together, the findings suggest that existing databases vary in their ability to signal specific emotions, thereby facing a trade-off between realism and ecological validity on the one end, and expression uniformity and comparability on the other
Sex differences in facial emotion recognition across varying expression intensity levels from videos
Recommended from our members
Estimating the heritability of reporting stressful life events captured by common genetic variants
BackgroundAlthough usually thought of as external environmental stressors, a significant heritable component has been reported for measures of stressful life events (SLEs) in twin studies. Method We examined the variance in SLEs captured by common genetic variants from a genome-wide association study (GWAS) of 2578 individuals. Genome-wide complex trait analysis (GCTA) was used to estimate the phenotypic variance tagged by single nucleotide polymorphisms (SNPs). We also performed a GWAS on the number of SLEs, and looked at correlations between siblings.ResultsA significant proportion of variance in SLEs was captured by SNPs (30%, p = 0.04). When events were divided into those considered to be dependent or independent, an equal amount of variance was explained for both. This 'heritability' was in part confounded by personality measures of neuroticism and psychoticism. A GWAS for the total number of SLEs revealed one SNP that reached genome-wide significance (p = 4 × 10-8), although this association was not replicated in separate samples. Using available sibling data for 744 individuals, we also found a significant positive correlation of R 2 = 0.08 in SLEs (p = 0.03).ConclusionsThese results provide independent validation from molecular data for the heritability of reporting environmental measures, and show that this heritability is in part due to both common variants and the confounding effect of personality.R. A. Power... S. Cohen-Woods... et al