63 research outputs found

    On the Complexity of Query Result Diversification

    Get PDF
    Query result diversification is a bi-criteria optimization problem for ranking query results. Given a database D, a query Q and a positive integer k, it is to find a set of k tuples from Q(D) such that the tuples are as relevant as possible to the query, and at the same time, as diverse as possible to each other. Subsets of Q(D) are ranked by an objective function defined in terms of relevance and diversity. Query result diversification has found a variety of applications in databases, information retrieval and operations research. This paper studies the complexity of result diversification for relational queries. We identify three problems in connection with query result diversification, to determine whether there exists a set of k tuples that is ranked above a bound with respect to relevance and diversity, to assess the rank of a given k-element set, and to count how many k-element sets are ranked above a given bound. We study these problems for a variety of query languages and for three objective functions. We establish the upper and lower bounds of these problems, all matching, for both combined complexity and data complexity. We also investigate several special settings of these problems, identifying tractable cases. 1

    Clinical, genetic, epidemiologic, evolutionary, and functional delineation of TSPEAR-related autosomal recessive ectodermal dysplasia 14

    Get PDF
    TSPEAR variants cause autosomal recessive ectodermal dysplasia (ARED) 14. The function of TSPEAR is unknown. The clinical features, the mutation spectrum, and the underlying mechanisms of ARED14 are poorly understood. Combining data from new and previously published individuals established that ARED14 is primarily characterized by dental anomalies such as conical tooth cusps and hypodontia, like those seen in individuals with WNT10A-related odontoonychodermal dysplasia. AlphaFold-predicted structure-based analysis showed that most of the pathogenic TSPEAR missense variants likely destabilize the β-propeller of the protein. Analysis of 100000 Genomes Project (100KGP) data revealed multiple founder TSPEAR variants across different populations. Mutational and recombination clock analyses demonstrated that non-Finnish European founder variants likely originated around the end of the last ice age, a period of major climatic transition. Analysis of gnomAD data showed that the non-Finnish European population TSPEAR gene-carrier rate is ∼1/140, making it one of the commonest AREDs. Phylogenetic and AlphaFold structural analyses showed that TSPEAR is an ortholog of drosophila Closca, an extracellular matrix-dependent signaling regulator. We, therefore, hypothesized that TSPEAR could have a role in enamel knot, a structure that coordinates patterning of developing tooth cusps. Analysis of mouse single-cell RNA sequencing (scRNA-seq) data revealed highly restricted expression of Tspear in clusters representing enamel knots. A tspeara−/−;tspearb−/− double-knockout zebrafish model recapitulated the clinical features of ARED14 and fin regeneration abnormalities of wnt10a knockout fish, thus suggesting interaction between tspear and wnt10a. In summary, we provide insights into the role of TSPEAR in ectodermal development and the evolutionary history, epidemiology, mechanisms, and consequences of its loss of function variants

    Construction of orthogonal and nearly orthogonal designs for computer experiments

    No full text
    This paper presents new infinite families of orthogonal designs for computer experiments. In cases where orthogonal designs cannot exist, we construct alternative, nearly orthogonal designs. Our designs can accommodate many factors and a large set of levels. No iterative computer search is required. To build up the desired orthogonal designs we develop and use new infinite classes of periodic Golay pairs

    Population characteristics of the Limpet Patella caerulea (Linnaeus, 1758) in Eastern Mediterranean (Central Greece)

    No full text
    Limpets are pivotal for structuring and regulating the ecological balance of littoral communities and arewidely collected for human consumption and as fishing bait. Limpets of the species Patella caerulea were collected between April 2016 and April 2017 from two sites, and two samplings per each site with varying degree of exposure to wave action and anthropogenic pressure, in Eastern Mediterranean (Pagasitikos Gulf, Central Greece). This study addresses a knowledge gap on population characteristics of P. caerulea populations in Eastern Mediterranean, assesses population structure, allometric relationships, and reproductive status. Morphometric characteristics exhibited spatio-temporal variation. Population density was significantly higher at the exposed site. Spatial relationship between members of thepopulation exhibitedclumpedpattern ofdispersionduring spring. Broadcast spawning of the population occurred during summer. Seven dominant age groupswere identified, with the dominant cohort in the third-year class. Significant negative allometric relationships were exhibited between morphometric characteristics. Differences in growth patterns among populations were indicated. © 2020 by the authors

    Unobtrusive Behavioral and Activity-Related Multimodal Biometrics: The ACTIBIO Authentication Concept

    Get PDF
    Unobtrusive Authentication Using ACTIvity-Related and Soft BIOmetrics (ACTIBIO) is an EU Specific Targeted Research Project (STREP) where new types of biometrics are combined with state-of-the-art unobtrusive technologies in order to enhance security in a wide spectrum of applications. The project aims to develop a modular, robust, multimodal biometrics security authentication and monitoring system, which uses a biodynamic physiological profile, unique for each individual, and advancements of the state of the art in unobtrusive behavioral and other biometrics, such as face, gait recognition, and seat-based anthropometrics. Several shortcomings of existing biometric recognition systems are addressed within this project, which have helped in improving existing sensors, in developing new algorithms, and in designing applications, towards creating new, unobtrusive, biometric authentication procedures in security-sensitive, Ambient Intelligence environments. This paper presents the concept of the ACTIBIO project and describes its unobtrusive authentication demonstrator in a real scenario by focusing on the vision-based biometric recognition modalities

    Support vector machines classification on class imbalanced data: a case study with real medical data

    No full text
    Support vector machines (SVMs) constitute one of the most popular and powerful classification methods. However, SVMs can be limited in their performance on highly imbalanced datasets. A classifier which has been trained on an imbalanced dataset can produce a biased model towards the majority class and result in high misclassification rate for minority class. For many applications, especially for medical diagnosis, it is of high importance to accurately distinguish false negative from false positive results. The purpose of this study is to successfully evaluate the performance of a classifier, keeping the correct balance between sensitivity and specificity, in order to enable the success of trauma outcome prediction. We compare the standard (or classic) SVM (C SVM) with resampling methods and a cost sensitive method, called Two Cost SVM (TC SVM), which constitute widely accepted strategies for imbalanced datasets and the derived results were discussed in terms of the sensitivity analysis and receiver operating characteristic (ROC) curves
    corecore