68 research outputs found

    Automatic Detection of Cyberbullying in Social Media Text

    Get PDF
    While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online. Recent studies report that cyberbullying constitutes a growing problem among youngsters. Successful prevention depends on the adequate detection of potentially harmful messages and the information overload on the Web requires intelligent systems to identify potential risks automatically. The focus of this paper is on automatic cyberbullying detection in social media text by modelling posts written by bullies, victims, and bystanders of online bullying. We describe the collection and fine-grained annotation of a training corpus for English and Dutch and perform a series of binary classification experiments to determine the feasibility of automatic cyberbullying detection. We make use of linear support vector machines exploiting a rich feature set and investigate which information sources contribute the most for this particular task. Experiments on a holdout test set reveal promising results for the detection of cyberbullying-related posts. After optimisation of the hyperparameters, the classifier yields an F1-score of 64% and 61% for English and Dutch respectively, and considerably outperforms baseline systems based on keywords and word unigrams.Comment: 21 pages, 9 tables, under revie

    Sudden cardiac death due to deficiency of the mitochondrial inorganic pyrophosphatase PPA2

    Get PDF
    We have used whole exome sequencing to identify biallelic missense mutations in the nuclearencoded mitochondrial inorganic pyrophosphatase (PPA2) in ten individuals from four unrelated pedigrees that are associated with mitochondrial disease. These individuals show a range of severity, indicating that PPA2 mutations may cause a spectrum of mitochondrial disease phenotypes. Severe symptoms include seizures, lactic acidosis and cardiac arrhythmia and death within days of birth. In the index family, presentation was milder and manifested as cardiac fibrosis and an exquisite sensitivity to alcohol, leading to sudden arrhythmic cardiac death in the second decade of life. Comparison of normal and mutated PPA2 containing mitochondria from fibroblasts showed the activity of inorganic pyrophosphatase significantly reduced in affected individuals. Recombinant PPA2 enzymes modeling hypomorphic missense mutations had decreased activity that correlated with disease severity. These findings confirm the pathogenicity of PPA2 mutations, and suggest that PPA2 is a new cardiomyopathy-associated protein, which has a greater physiological importance in mitochondrial function than previously recognized

    Convergence in the temperature response of leaf respiration across biomes and plant functional types

    Get PDF
    Plant respiration constitutes a massive carbon flux to the atmosphere, and a major control on the evolution of the global carbon cycle. It therefore has the potential to modulate levels of climate change due to the human burning of fossil fuels. Neither current physiological nor terrestrial biosphere models adequately describe its short-term temperature response, and even minor differences in the shape of the response curve can significantly impact estimates of ecosystem carbon release and/or storage. Given this, it is critical to establish whether there are predictable patterns in the shape of the respiration–temperature response curve, and thus in the intrinsic temperature sensitivity of respiration across the globe. Analyzing measurements in a comprehensive database for 231 species spanning 7 biomes, we demonstrate that temperature-dependent increases in leaf respiration do not follow a commonly used exponential function. Instead, we find a decelerating function as leaves warm, reflecting a declining sensitivity to higher temperatures that is remarkably uniform across all biomes and plant functional types. Such convergence in the temperature sensitivity of leaf respiration suggests that there are universally applicable controls on the temperature response of plant energy metabolism, such that a single new function can predict the temperature dependence of leaf respiration for global vegetation. This simple function enables straightforward description of plant respiration in the land-surface components of coupled earth system models. Our cross-biome analyses shows significant implications for such fluxes in cold climates, generally projecting lower values compared with previous estimates

    Erratum to: Methods for evaluating medical tests and biomarkers

    Get PDF
    [This corrects the article DOI: 10.1186/s41512-016-0001-y.]

    Common, low-frequency, rare, and ultra-rare coding variants contribute to COVID-19 severity

    Get PDF
    The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole-exome sequencing data of about 4000 SARS-CoV-2-positive individuals were used to define an interpretable machine-learning model for predicting COVID-19 severity. First, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthily, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management. © 2021, The Author(s)

    Sixteen diverse laboratory mouse reference genomes define strain-specific haplotypes and novel functional loci.

    Get PDF
    We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene structures. Also, 62 new coding loci were added to the reference genome annotation. These genomes identified a large, previously unannotated, gene (Efcab3-like) encoding 5,874 amino acids. Mutant Efcab3-like mice display anomalies in multiple brain regions, suggesting a possible role for this gene in the regulation of brain development

    Response and inbreeding from a genomic selection experiment in layer chickens

    Get PDF
    International audienceGenomic selection (GS) using estimated breeding values (GS-EBV) based on dense marker data is a promising approach for genetic improvement. A simulation study was undertaken to illustrate the opportunities offered by GS for designing breeding programs. It consisted of a selection program for a sex-limited trait in layer chickens, which was developed by deterministic predictions under different scenarios. Later, one of the possible schemes was implemented in a real population of layer chicken.MethodsIn the simulation, the aim was to double the response to selection per year by reducing the generation interval by 50 %, while maintaining the same rate of inbreeding per year. We found that GS with retraining could achieve the set objectives while requiring 75 % fewer reared birds and 82 % fewer phenotyped birds per year. A multi-trait GS scenario was subsequently implemented in a real population of brown egg laying hens. The population was split into two sub-lines, one was submitted to conventional phenotypic selection, and one was selected based on genomic prediction. At the end of the 3-year experiment, the two sub-lines were compared for multiple performance traits that are relevant for commercial egg production.ResultsBirds that were selected based on genomic prediction outperformed those that were submitted to conventional selection for most of the 16 traits that were included in the index used for selection. However, although the two programs were designed to achieve the same rate of inbreeding per year, the realized inbreeding per year assessed from pedigree was higher in the genomic selected line than in the conventionally selected line.ConclusionsThe results demonstrate that GS is a promising alternative to conventional breeding for genetic improvement of layer chickens

    Comprehensive Rare Variant Analysis via Whole-Genome Sequencing to Determine the Molecular Pathology of Inherited Retinal Disease

    Get PDF
    Inherited retinal disease is a common cause of visual impairment and represents a highly heterogeneous group of conditions. Here, we present findings from a cohort of 722 individuals with inherited retinal disease, who have had whole-genome sequencing (n = 605), whole-exome sequencing (n = 72), or both (n = 45) performed, as part of the NIHR-BioResource Rare Diseases research study. We identified pathogenic variants (single-nucleotide variants, indels, or structural variants) for 404/722 (56%) individuals. Whole-genome sequencing gives unprecedented power to detect three categories of pathogenic variants in particular: structural variants, variants in GC-rich regions, which have significantly improved coverage compared to whole-exome sequencing, and variants in non-coding regulatory regions. In addition to previously reported pathogenic regulatory variants, we have identified a previously unreported pathogenic intronic variant in CHM\textit{CHM} in two males with choroideremia. We have also identified 19 genes not previously known to be associated with inherited retinal disease, which harbor biallelic predicted protein-truncating variants in unsolved cases. Whole-genome sequencing is an increasingly important comprehensive method with which to investigate the genetic causes of inherited retinal disease.This work was supported by The National Institute for Health Research England (NIHR) for the NIHR BioResource – Rare Diseases project (grant number RG65966). The Moorfields Eye Hospital cohort of patients and clinical and imaging data were ascertained and collected with the support of grants from the National Institute for Health Research Biomedical Research Centre at Moorfields Eye Hospital, National Health Service Foundation Trust, and UCL Institute of Ophthalmology, Moorfields Eye Hospital Special Trustees, Moorfields Eye Charity, the Foundation Fighting Blindness (USA), and Retinitis Pigmentosa Fighting Blindness. M.M. is a recipient of an FFB Career Development Award. E.M. is supported by UCLH/UCL NIHR Biomedical Research Centre. F.L.R. and D.G. are supported by Cambridge NIHR Biomedical Research Centre

    The development and validation of a scoring tool to predict the operative duration of elective laparoscopic cholecystectomy

    Get PDF
    Background: The ability to accurately predict operative duration has the potential to optimise theatre efficiency and utilisation, thus reducing costs and increasing staff and patient satisfaction. With laparoscopic cholecystectomy being one of the most commonly performed procedures worldwide, a tool to predict operative duration could be extremely beneficial to healthcare organisations. Methods: Data collected from the CholeS study on patients undergoing cholecystectomy in UK and Irish hospitals between 04/2014 and 05/2014 were used to study operative duration. A multivariable binary logistic regression model was produced in order to identify significant independent predictors of long (> 90 min) operations. The resulting model was converted to a risk score, which was subsequently validated on second cohort of patients using ROC curves. Results: After exclusions, data were available for 7227 patients in the derivation (CholeS) cohort. The median operative duration was 60 min (interquartile range 45–85), with 17.7% of operations lasting longer than 90 min. Ten factors were found to be significant independent predictors of operative durations > 90 min, including ASA, age, previous surgical admissions, BMI, gallbladder wall thickness and CBD diameter. A risk score was then produced from these factors, and applied to a cohort of 2405 patients from a tertiary centre for external validation. This returned an area under the ROC curve of 0.708 (SE = 0.013, p  90 min increasing more than eightfold from 5.1 to 41.8% in the extremes of the score. Conclusion: The scoring tool produced in this study was found to be significantly predictive of long operative durations on validation in an external cohort. As such, the tool may have the potential to enable organisations to better organise theatre lists and deliver greater efficiencies in care

    Abiraterone for Prostate Cancer Not Previously Treated with Hormone Therapy

    Get PDF
    BACKGROUND Abiraterone acetate plus prednisolone improves survival in men with relapsed prostate cancer. We assessed the effect of this combination in men starting long-term androgen-deprivation therapy (ADT), using a multigroup, multistage trial design. METHODS We randomly assigned patients in a 1:1 ratio to receive ADT alone or ADT plus abiraterone acetate (1000 mg daily) and prednisolone (5 mg daily) (combination therapy). Local radiotherapy was mandated for patients with node-negative, nonmetastatic disease and encouraged for those with positive nodes. For patients with nonmetastatic disease with no radiotherapy planned and for patients with metastatic disease, treatment continued until radiologic, clinical, or prostate-specific antigen (PSA) progression; otherwise, treatment was to continue for 2 years or until any type of progression, whichever came first. The primary outcome measure was overall survival. The intermediate primary outcome was failure-free survival (treatment failure was defined as radiologic, clinical, or PSA progression or death from prostate cancer). RESULTS A total of 1917 patients underwent randomization from November 2011 through January 2014. The median age was 67 years, and the median PSA level was 53 ng per milliliter. A total of 52% of the patients had metastatic disease, 20% had node-positive or node-indeterminate nonmetastatic disease, and 28% had node-negative, nonmetastatic disease; 95% had newly diagnosed disease. The median follow-up was 40 months. There were 184 deaths in the combination group as compared with 262 in the ADT-alone group (hazard ratio, 0.63; 95% confidence interval [CI], 0.52 to 0.76; P<0.001); the hazard ratio was 0.75 in patients with nonmetastatic disease and 0.61 in those with metastatic disease. There were 248 treatment-failure events in the combination group as compared with 535 in the ADT-alone group (hazard ratio, 0.29; 95% CI, 0.25 to 0.34; P<0.001); the hazard ratio was 0.21 in patients with nonmetastatic disease and 0.31 in those with metastatic disease. Grade 3 to 5 adverse events occurred in 47% of the patients in the combination group (with nine grade 5 events) and in 33% of the patients in the ADT-alone group (with three grade 5 events). CONCLUSIONS Among men with locally advanced or metastatic prostate cancer, ADT plus abiraterone and prednisolone was associated with significantly higher rates of overall and failure-free survival than ADT alone. (Funded by Cancer Research U.K. and others; STAMPEDE ClinicalTrials.gov number, NCT00268476, and Current Controlled Trials number, ISRCTN78818544.
    corecore