12 research outputs found

    Capisco: Semantic Analysis of Documents from the HathiTrust Corpus

    Get PDF
    The Capisco project developed a suite of tools that analyze documents by the semantics of their content and metadata. Clustering documents by semantic similarity opens a wealth of opportunities for scholarly research.The project was designed in close collaboration with two humanities scholars, from the areas of Maori & Pacific Studies and Historical Anthropology, who provided ongoing input and feedback during the development process. This report was submitted to the Workset Creation for Scholarly Analysis: Prototyping Project, funded by the Andrew W. Mellon Foundation.Andrew W. Mellon Foundation, Grant Reference No. 21300666Ope

    Prognostic impact of CEBPA mutational subgroups in adult AML

    Get PDF
    Despite recent refinements in the diagnostic and prognostic assessment of CEBPA mutations in AML, several questions remain open, i.e. implications of different types of basic region leucin zipper (bZIP) mutations, the role of co-mutations and the allelic state. Using pooled primary data analysis on 1010 CEBPA-mutant adult AML patients, a comparison was performed taking into account the type of mutation (bZIP: either typical in-frame insertion/deletion (InDel) mutations (bZIPInDel), frameshift InDel or nonsense mutations inducing translational stop (bZIPSTOP) or single base-pair missense alterations (bZIPms), and transcription activation domain (TAD) mutations) and the allelic state (single (smCEBPA) vs. double mutant (dmCEBPA)). Only bZIPInDel patients had significantly higher rates of complete remission and longer relapse free and overall survival (OS) compared with all other CEBPA-mutant subgroups. Moreover, co-mutations in bZIPInDel patients (e.g. GATA2, FLT3, WT1 as well as ELN2022 adverse risk aberrations) had no independent impact on OS, whereas in non-bZIPInDel patients, grouping according to ELN2022 recommendations added significant prognostic information. In conclusion, these results demonstrate bZIPInDel mutations to be the major independent determinant of outcome in CEBPA-mutant AML, thereby refining current classifications according to WHO (including all dmCEBPA and smCEBPA bZIP) as well as ELN2022 and ICC recommendations (including CEBPA bZIPms)

    Seeding strategies for semantic disambiguation

    Get PDF
    Semantic disambiguation determines the meaning of words and phrases in a text, for which we use an automatically-generated Concept-in-Context (CiC) network. Words and phrases rarely belong to a single concept; disambiguation in Capisco relies on interplay between words that are in close vicinity in the text. Starting the disambiguation is a seeding process, that identifies the first concepts, which then form the context for further disambiguation steps. This paper introduces the seeding algorithm and explores seeding strategies for identifying these initial concepts in text volumes, such as books, that are stored in a digital library

    LIONS PREY: A New Logistic Scoring System for the Prediction of Malignant Pulmonary Nodules

    No full text
    Objectives: Classifying radiologic pulmonary lesions as malignant is challenging. Scoring systems like the Mayo model lack precision in predicting the probability of malignancy. We developed the logistic scoring system ‘LIONS PREY’ (Lung lesION Score PREdicts malignancY), which is superior to existing models in its precision in determining the likelihood of malignancy. Methods: We evaluated all patients that were presented to our multidisciplinary team between January 2013 and December 2020. Availability of pathological results after resection or CT-/EBUS-guided sampling was mandatory for study inclusion. Two groups were formed: Group A (malignant nodule; n = 238) and Group B (benign nodule; n = 148). Initially, 22 potential score parameters were derived from the patients’ medical histories. Results: After uni- and multivariate analysis, we identified the following eight parameters that were integrated into a scoring system: (1) age (Group A: 64.5 ± 10.2 years vs. Group B: 61.6 ± 13.8 years; multivariate p-value: 0.054); (2) nodule size (21.8 ± 7.5 mm vs. 18.3 ± 7.9 mm; p = 0.051); (3) spiculation (73.1% vs. 41.9%; p = 0.024); (4) solidity (84.9% vs. 62.8%; p = 0.004); (5) size dynamics (6.4 ± 7.7 mm/3 months vs. 0.2 ± 0.9 mm/3 months; p p p = 0.079); and (8) cancer history (34.9% vs. 24.3%; p = 0.052). Our model demonstrated superior precision to that of the Mayo score (p = 0.013) with an overall correct classification of 96.0%, a calibration (observed/expected-ratio) of 1.1, and a discrimination (ROC analysis) of AUC (95% CI) 0.94 (0.92–0.97). Conclusions: Focusing on essential parameters, LIONS PREY can be easily and reproducibly applied based on computed tomography (CT) scans. Multidisciplinary team members could use it to facilitate decision making. Patients may find it easier to consent to surgery knowing the likelihood of pulmonary malignancy. The LIONS PREY app is available for free on Android and iOS devices
    corecore