47 research outputs found

    Presenting a Labelled Dataset for Real-Time Detection of Abusive User Posts

    Get PDF
    Social media sites facilitate users in posting their own personal comments online. Most support free format user posting, with close to real-time publishing speeds. However, online posts generated by a public user audience carry the risk of containing inappropriate, potentially abusive content. To detect such content, the straightforward approach is to filter against blacklists of profane terms. However, this lexicon filtering approach is prone to problems around word variations and lack of context. Although recent methods inspired by machine learning have boosted detection accuracies, the lack of gold standard labelled datasets limits the development of this approach. In this work, we present a dataset of user comments, using crowdsourcing for labelling. Since abusive content can be ambiguous and subjective to the individual reader, we propose an aggregated mechanism for assessing different opinions from different labellers. In addition, instead of the typical binary categories of abusive or not, we introduce a third class of ‘undecided’ to capture the real life scenario of instances that are neither blatantly abusive nor clearly harmless. We have performed preliminary experiments on this dataset using best practice techniques in text classification. Finally, we have evaluated the detection performance of various feature groups, namely syntactic, semantic and context-based features. Results show these features can increase our classifier performance by 18% in detection of abusive content

    Oncogenic gene expression and epigenetic remodeling of cis-regulatory elements in ASXL1-mutant chronic myelomonocytic leukemia

    Get PDF
    Myeloid neoplasms are clonal hematopoietic stem cell disorders driven by the sequential acquisition of recurrent genetic lesions. Truncating mutations in the chromatin remodeler ASXL1 (ASXL1MT) are associated with a high-risk disease phenotype with increased proliferation, epigenetic therapeutic resistance, and poor survival outcomes. We performed a multi-omics interrogation to define gene expression and chromatin remodeling associated with ASXL1MT in chronic myelomonocytic leukemia (CMML). ASXL1MT are associated with a loss of repressive histone methylation and increase in permissive histone methylation and acetylation in promoter regions. ASXL1MT are further associated with de novo accessibility of distal enhancers binding ETS transcription factors, targeting important leukemogenic driver genes. Chromatin remodeling of promoters and enhancers is strongly associated with gene expression and heterogenous among overexpressed genes. These results provide a comprehensive map of the transcriptome and chromatin landscape of ASXL1MT CMML, forming an important framework for the development of novel therapeutic strategies targeting oncogenic cis interactions

    Prognostic relevance of clonal hematopoiesis in myeloid neoplastic transformation in patients with follicular lymphoma treated with radioimmunotherapy

    Get PDF
    While novel radioisotope therapies continue to advance cancer care, reports of therapy-related myeloid neoplasms (t-MN) have generated concern. The prevalence and role of clonal hematopoiesis (CH) in this process remain to be defined. We hypothesized that: (i) CH is prevalent in relapsed follicular lymphoma and is associated with t-MN transformation, and (ii) radiation in the form of radioimmunotherapy (RIT) plays a role in clonal progression. In this retrospective cohort study, we evaluated the prevalence and prognostic impact of CH on clinical outcomes in 58 heavily pre-treated follicular lymphoma patients who received RIT. Patients had been given a median of four lines of therapy before RIT. The prevalence of CH prior to RIT was 46%, while it was 67% (P=0.15) during the course of RIT and subsequent therapies in the paired samples. Fourteen (24%) patients developed t-MN. Patients with t-MN had a higher variant allele fraction (38% vs. 15%; P=0.02) and clonal complexity (P=0.03) than those without. The spectrum of CH differed from that in age-related CH, with a high prevalence of DNA damage repair and response pathway mutations, absence of spliceosome mutations, and a paucity of signaling mutations. While there were no clear clinical associations between RIT and t-MN, or overall survival, patients with t-MN had a higher mutant clonal burden, along with extensive chromosomal abnormalities (median survival, afer t-MN diagnosis, 0.9 months). The baseline prevalence of CH was high, with an increase in prevalence on exposure to RIT and subsequent therapies. The high rates of t-MN with marked clonal complexities and extensive chromosomal damage underscore the importance of better identifying and studying genotoxic stressors accentuated by therapeutic modalities

    Prognostic impact of <i>SF3B1</i> mutation and multilineage dysplasia in myelodysplastic syndromes with ring sideroblasts: a Mayo Clinic study of 170 informative cases

    Get PDF
    The revised 4th edition of the World Health Organization (WHO4R) classification lists myelodysplastic syndromes with ring sideroblasts (MDS-RS) as a separate entity with single lineage (MDS-RS-SLD) or multilineage (MDS-RS-MLD) dysplasia. The more recent International Consensus Classification (ICC) distinguishes between MDS with SF3B1 mutation (MDS-SF3B1) and MDS-RS without SF3B1 mutation; the latter is instead included under the category of MDS not otherwise specified. The current study includes 170 Mayo Clinic patients with WHO4R-defined MDS-RS, including MDS-RS-SLD (N=83) and MDS-RS-MLD (N=87); a subset of 145 patients were also evaluable for the presence of SF3B1 and other mutations, including 126 with (87%) and 19 (13%) without SF3B1 mutation. Median overall survival for all 170 patients was 6.6 years with 5- and 10-year survival rates of 59% and 25%, respectively. A significant difference in overall survival was apparent between MDS-RS-MLD and MDS-RS-SLD (p<0.01) but not between MDS-RS with and without SF3B1 mutation (p=0.36). Multivariable analysis confirmed the independent prognostic contribution of MLD (HR 1.8, 95% CI 1.1-2.8; p=0.01) and also identified age (p<0.01), transfusion need at diagnosis (p<0.01), and abnormal karyotype (p<0.01), as additional risk factors; the impact from SF3B1 or other mutations was not significant. Leukemia-free survival was independently affected by abnormal karyotype (p<0.01), RUNX1 (0.02) and IDH1 (p=0.01) mutations, but not by MLD or SF3B1 mutation. Exclusion of patients not meeting ICC-criteria for MDSSF3B1 did not change the observations on overall survival. MLD-based, as opposed to SF3B1 mutationbased, disease classification for MDS-RS might be prognostically more relevant

    Clinical and molecular correlates of somatic and germline <i>DDX41</i> variants in patients and families with myeloid neoplasms

    Get PDF
    The diagnosis of germline predisposition to myeloid neoplasms (MN) secondary to DDX41 variants is currently hindered by the long latency period, variable family histories and the frequent occurrence of DDX41 variants of uncertain significance (VUS). We reviewed 4,524 consecutive patients who underwent targeted sequencing for suspected or known MN and analyzed the clinical impact and relevance of DDX41VUS in comparison to DDX41path variants. Among 107 patients (44 [0.9%] DDX41path and 63 DDX41VUS [1.4%; 11 patients with both DDX41path and DDX41VUS]), we identified 17 unique DDX41path and 45 DDX41VUS variants: 24 (23%) and 77 (72%) patients had proven and presumed germline DDX41 variants, respectively. The median age was similar between DDX41path and DDX41VUS (66 vs. 62 years; P=0.41). The median variant allele frequency (VAF) (47% vs. 48%; P=0.62), frequency of somatic myeloid co-mutations (34% vs 25%; P= 0.28), cytogenetic abnormalities (16% vs. 12%; P=>0.99) and family history of hematological malignancies (20% vs. 33%; P=0.59) were comparable between the two groups. Time to treatment in months (1.53 vs. 0.3; P=0.16) and proportion of patients progressing to acute myeloid leukemia (14% vs. 11%; P=0.68), were similar. The median overall survival in patients with high-risk myelodysplastic syndrome/acute myloid leukemia was 63.4 and 55.7 months in the context of DDX41path and DDX41VUS, respectively (P=0.93). Comparable molecular profiles and clinical outcomes among DDX41path and DDX41VUS patients highlights the need for a comprehensive DDX41 variant interrogation/classification system, to improve surveillance and management strategies in patients and families with germline DDX41 predisposition syndromes
    corecore