191 research outputs found

    VLSP Shared Task: Named Entity Recognition

    Get PDF
    Named entities (NE) are phrases that contain the names of persons, organizations, locations, times and quantities, monetary values, percentages, etc. Named Entity Recognition (NER) is the task of recognizing named entities in documents. NER is an important subtask of Information Extraction, which has attracted researchers all over the world since 1990s. For Vietnamese language, although there exists some research projects and publications on NER task before 2016, no systematic comparison of the performance of NER systems has been done. In 2016, the organizing committee of the VLSP workshop decided to launch the first NER shared task, in order to get an objective evaluation of Vietnamese NER systems and to promote the development of high quality systems. As a result, the first dataset with morpho-syntactic and NE annotations has been released for benchmarking NER systems. At VLSP 2018, the NER shared task has been organized for the second time, providing a bigger dataset containing texts from various domains, but without morpho-syntactic annotation. These resources are available for research purpose via the VLSP website vlsp.org.vn/resources. In this paper, we describe the datasets as well as the evaluation results obtained from these two campaigns

    VLSP SHARED TASK: SENTIMENT ANALYSIS

    Get PDF
    Sentiment analysis is a natural language processing (NLP) task of identifying orextracting the sentiment content of a text unit. This task has become an active research topic since the early 2000s. During the two last editions of the VLSP workshop series, the shared task on Sentiment Analysis (SA) for Vietnamese has been organized in order to provide an objective evaluation measurement about the performance (quality) of sentiment analysis tools, and encouragethe development of Vietnamese sentiment analysis systems, as well as to provide benchmark datasets for this task. The rst campaign in 2016 only focused on the sentiment polarity classication, with a dataset containing reviews of electronic products. The second campaign in 2018 addressed the problem of Aspect Based Sentiment Analysis (ABSA) for Vietnamese, by providing two datasets containing reviews in restaurant and hotel domains. These data are accessible for research purpose via the VLSP website vlsp.org.vn/resources. This paper describes the built datasets as well as the evaluation results of the systems participating to these campaigns

    Epidemiology of facial fractures: Incidence, prevalence and years lived with disability estimates from the Global Burden of Disease 2017 study

    Get PDF
    Background: The Global Burden of Disease Study (GBD) has historically produced estimates of causes of injury such as falls but not the resulting types of injuries that occur. The objective of this study was to estimate the global incidence, prevalence and years lived with disability (YLDs) due to facial fractures and to estimate the leading injurious causes of facial fracture. Methods: We obtained results from GBD 2017. First, the study estimated the incidence from each injury cause (eg, falls), and then the proportion of each cause that would result in facial fracture being the most disabling injury. Incidence, prevalence and YLDs of facial fractures are then calculated across causes. Results: Globally, in 2017, there were 7 538 663 (95% uncertainty interval 6 116 489 to 9 4

    Global, regional, and national burden of chronic kidney disease, 1990–2017 : a systematic analysis for the Global Burden of Disease Study 2017

    Get PDF
    Background Health system planning requires careful assessment of chronic kidney disease (CKD) epidemiology, but data for morbidity and mortality of this disease are scarce or non-existent in many countries. We estimated the global, regional, and national burden of CKD, as well as the burden of cardiovascular disease and gout attributable to impaired kidney function, for the Global Burden of Diseases, Injuries, and Risk Factors Study 2017. We use the term CKD to refer to the morbidity and mortality that can be directly attributed to all stages of CKD, and we use the term impaired kidney function to refer to the additional risk of CKD from cardiovascular disease and gout. Methods The main data sources we used were published literature, vital registration systems, end-stage kidney disease registries, and household surveys. Estimates of CKD burden were produced using a Cause of Death Ensemble model and a Bayesian meta-regression analytical tool, and included incidence, prevalence, years lived with disability, mortality, years of life lost, and disability-adjusted life-years (DALYs). A comparative risk assessment approach was used to estimate the proportion of cardiovascular diseases and gout burden attributable to impaired kidney function. Findings Globally, in 2017, 1·2 million (95% uncertainty interval [UI] 1·2 to 1·3) people died from CKD. The global all-age mortality rate from CKD increased 41·5% (95% UI 35·2 to 46·5) between 1990 and 2017, although there was no significant change in the age-standardised mortality rate (2·8%, −1·5 to 6·3). In 2017, 697·5 million (95% UI 649·2 to 752·0) cases of all-stage CKD were recorded, for a global prevalence of 9·1% (8·5 to 9·8). The global all-age prevalence of CKD increased 29·3% (95% UI 26·4 to 32·6) since 1990, whereas the age-standardised prevalence remained stable (1·2%, −1·1 to 3·5). CKD resulted in 35·8 million (95% UI 33·7 to 38·0) DALYs in 2017, with diabetic nephropathy accounting for almost a third of DALYs. Most of the burden of CKD was concentrated in the three lowest quintiles of Socio-demographic Index (SDI). In several regions, particularly Oceania, sub-Saharan Africa, and Latin America, the burden of CKD was much higher than expected for the level of development, whereas the disease burden in western, eastern, and central sub-Saharan Africa, east Asia, south Asia, central and eastern Europe, Australasia, and western Europe was lower than expected. 1·4 million (95% UI 1·2 to 1·6) cardiovascular disease-related deaths and 25·3 million (22·2 to 28·9) cardiovascular disease DALYs were attributable to impaired kidney function. Interpretation Kidney disease has a major effect on global health, both as a direct cause of global morbidity and mortality and as an important risk factor for cardiovascular disease. CKD is largely preventable and treatable and deserves greater attention in global health policy decision making, particularly in locations with low and middle SDI

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts

    Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

    Get PDF
    Beyond sample curation and basic pathologic characterization, the digitized H&E-stained images of TCGA samples remain underutilized. To highlight this resource, we present mappings of tumorinfiltrating lymphocytes (TILs) based on H&E images from 13 TCGA tumor types. These TIL maps are derived through computational staining using a convolutional neural network trained to classify patches of images. Affinity propagation revealed local spatial structure in TIL patterns and correlation with overall survival. TIL map structural patterns were grouped using standard histopathological parameters. These patterns are enriched in particular T cell subpopulations derived from molecular measures. TIL densities and spatial structure were differentially enriched among tumor types, immune subtypes, and tumor molecular subtypes, implying that spatial infiltrate state could reflect particular tumor cell aberration states. Obtaining spatial lymphocytic patterns linked to the rich genomic characterization of TCGA samples demonstrates one use for the TCGA image archives with insights into the tumor-immune microenvironment

    Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

    Get PDF
    This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

    Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

    Get PDF
    Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified. In contrast, the MYCantagonists MGA and MNT were the most frequentlymutated or deleted members, proposing a roleas tumor suppressors.MYCalterations were mutu-ally exclusive withPIK3CA,PTEN,APC,orBRAFalterations, suggesting that MYC is a distinct onco-genic driver. Expression analysis revealed MYC-associated pathways in tumor subtypes, such asimmune response and growth factor signaling; chro-matin, translation, and DNA replication/repair wereconserved pan-cancer. This analysis reveals insightsinto MYC biology and is a reference for biomarkersand therapeutics for cancers with alterations ofMYC or the PMN

    Global, regional, and national burden of tuberculosis, 1990–2016: results from the Global Burden of Diseases, Injuries, and Risk Factors 2016 Study

    Get PDF
    Background Although a preventable and treatable disease, tuberculosis causes more than a million deaths each year. As countries work towards achieving the Sustainable Development Goal (SDG) target to end the tuberculosis epidemic by 2030, robust assessments of the levels and trends of the burden of tuberculosis are crucial to inform policy and programme decision making. We assessed the levels and trends in the fatal and non-fatal burden of tuberculosis by drug resistance and HIV status for 195 countries and territories from 1990 to 2016. Methods We analysed 15 943 site-years of vital registration data, 1710 site-years of verbal autopsy data, 764 site-years of sample-based vital registration data, and 361 site-years of mortality surveillance data to estimate mortality due to tuberculosis using the Cause of Death Ensemble model. We analysed all available data sources, including annual case notifications, prevalence surveys, population-based tuberculin surveys, and estimated tuberculosis cause-specific mortality to generate internally consistent estimates of incidence, prevalence, and mortality using DisMod-MR 2.1, a Bayesian meta-regression tool. We assessed how the burden of tuberculosis differed from the burden predicted by the Socio-demographic Index (SDI), a composite indicator of income per capita, average years of schooling, and total fertility rate. Findings Globally in 2016, among HIV-negative individuals, the number of incident cases of tuberculosis was 9·02 million (95% uncertainty interval [UI] 8·05–10·16) and the number of tuberculosis deaths was 1·21 million (1·16–1·27). Among HIV-positive individuals, the number of incident cases was 1·40 million (1·01–1·89) and the number of tuberculosis deaths was 0·24 million (0·16–0·31). Globally, among HIV-negative individuals the age-standardised incidence of tuberculosis decreased annually at a slower rate (–1·3% [–1·5 to −1·2]) than mortality did (–4·5% [–5·0 to −4·1]) from 2006 to 2016. Among HIV-positive individuals during the same period, the rate of change in annualised age-standardised incidence was −4·0% (–4·5 to −3·7) and mortality was −8·9% (–9·5 to −8·4). Several regions had higher rates of age-standardised incidence and mortality than expected on the basis of their SDI levels in 2016. For drug-susceptible tuberculosis, the highest observed-to-expected ratios were in southern sub-Saharan Africa (13·7 for incidence and 14·9 for mortality), and the lowest ratios were in high-income North America (0·4 for incidence) and Oceania (0·3 for mortality). For multidrug-resistant tuberculosis, eastern Europe had the highest observed-to-expected ratios (67·3 for incidence and 73·0 for mortality), and high-income North America had the lowest ratios (0·4 for incidence and 0·5 for mortality). Interpretation If current trends in tuberculosis incidence continue, few countries are likely to meet the SDG target to end the tuberculosis epidemic by 2030. Progress needs to be accelerated by improving the quality of and access to tuberculosis diagnosis and care, by developing new tools, scaling up interventions to prevent risk factors for tuberculosis, and integrating control programmes for tuberculosis and HIV

    Epidemiology of injuries from fire, heat and hot substances : global, regional and national morbidity and mortality estimates from the Global Burden of Disease 2017 study

    Get PDF
    Background Past research has shown how fires, heat and hot substances are important causes of health loss globally. Detailed estimates of the morbidity and mortality from these injuries could help drive preventative measures and improved access to care. Methods We used the Global Burden of Disease 2017 framework to produce three main results. First, we produced results on incidence, prevalence, years lived with disability, deaths, years of life lost and disability-adjusted life years from 1990 to 2017 for 195 countries and territories. Second, we analysed these results to measure mortality-to-incidence ratios by location. Third, we reported the measures above in terms of the cause of fire, heat and hot substances and the types of bodily injuries that result. Results Globally, there were 8 991 468 (7 481 218 to 10 740 897) new fire, heat and hot substance injuries in 2017 with 120 632 (101 630 to 129 383) deaths. At the global level, the age-standardised mortality caused by fire, heat and hot substances significantly declined from 1990 to 2017, but regionally there was variability in age-standardised incidence with some regions experiencing an increase (eg, Southern Latin America) and others experiencing a significant decrease (eg, High-income North America). Conclusions The incidence and mortality of injuries that result from fire, heat and hot substances affect every region of the world but are most concentrated in middle and lower income areas. More resources should be invested in measuring these injuries as well as in improving infrastructure, advancing safety measures and ensuring access to care.Peer reviewe
    • …
    corecore