14 research outputs found

    Top-k String Auto-Completion with Synonyms

    Get PDF
    Auto-completion is one of the most prominent features of modern information systems. The existing solutions of auto-completion provide the suggestions based on the beginning of the currently input character sequence (i.e. prefix). However, in many real applications, one entity often has synonyms or abbreviations. For example, "DBMS" is an abbreviation of "Database Management Systems". In this paper, we study a novel type of auto-completion by using synonyms and abbreviations. We propose three trie-based algorithms to solve the top-k auto-completion with synonyms; each one with different space and time complexity trade-offs. Experiments on large-scale datasets show that it is possible to support effective and efficient synonym-based retrieval of completions of a million strings with thousands of synonyms rules at about a microsecond per-completion, while taking small space overhead (i.e. 160-200 bytes per string).Peer reviewe

    Benefits of biomarker selection and clinico-pathological covariate inclusion in breast cancer prognostic models

    Get PDF
    Introduction: Multi-marker molecular assays have impacted management of early stage breast cancer, facilitating adjuvant chemotherapy decisions. We generated prognostic models that incorporate protein-based molecular markers and clinico-pathological variables to improve survival prediction. Methods: We used a quantitative immunofluorescence method to study protein expression of 14 markers included in the Oncotype DX™ assay on a 638 breast cancer patient cohort with 15-year follow-up. We performed cross-validation analyses to assess performance of multivariate Cox models consisting of these markers and standard clinico-pathological covariates, using an average time-dependent Area Under the Receiver Operating Characteristic curves and compared it to nested Cox models obtained by robust backward selection procedures. Results: A prognostic index derived from of a multivariate Cox regression model incorporating molecular and clinico-pathological covariates (nodal status, tumor size, nuclear grade, and age) is superior to models based on molecular studies alone or clinico-pathological covariates alone. Performance of this composite model can be further improved using feature selection techniques to prune variables. When stratifying patients by Nottingham Prognostic Index (NPI), the most prognostic markers in high and low NPI groups differed. Similarly, for the node-negative, hormone receptor-positive sub-population, we derived a compact model with three clinico-pathological variables and two protein markers that was superior to the full model. Conclusions: Prognostic models that include both molecular and clinico-pathological covariates can be more accurate than models based on either set of features alone. Furthermore, feature selection can decrease the number of molecular variables needed to predict outcome, potentially resulting in less expensive assays.This work was supported by a grant from the Susan G Komen Foundation (to YK)

    Multi-level police patrol planning

    No full text

    Optimizing Meta-heuristics for the Time-Dependent TSP Applied to Air Travels

    Get PDF
    A travel agency has recently proposed the Traveling Salesman Challenge (TSC), a problem consisting of finding the best flights to visit a set of cities with the least cost. Our approach to this challenge consists on using a meta-optimized Ant Colony Optimization (ACO) strategy which, at the end of each iteration, generates a new ant by running Simulated Annealing or applying a mutation operator to the best ant of the iteration. Results are compared to variations of this algorithm, as well as to other meta-heuristic methods. They show that the developed approach is a better alternative than regular ACO for the time-dependent TSP class of problems, and that applying a K-Opt optimization will usually improve the results. (c) 2018, Springer Nature Switzerland AG
    corecore