31 research outputs found

    Perspectives on tracking data reuse across biodata resources

    Get PDF
    c The Author(s) 2024. Published by Oxford University Press.Motivation: Data reuse is a common and vital practice in molecular biology and enables the knowledge gathered over recent decades to drive discovery and innovation in the life sciences. Much of this knowledge has been collated into molecular biology databases, such as UniProtKB, and these resources derive enormous value from sharing data among themselves. However, quantifying and documenting this kind of data reuse remains a challenge. Results: The article reports on a one-day virtual workshop hosted by the UniProt Consortium in March 2023, attended by representatives from biodata resources, experts in data management, and NIH program managers. Workshop discussions focused on strategies for tracking data reuse, best practices for reusing data, and the challenges associated with data reuse and tracking. Surveys and discussions showed that data reuse is widespread, but critical information for reproducibility is sometimes lacking. Challenges include costs of tracking data reuse, tensions between tracking data and open sharing, restrictive licenses, and difficulties in tracking commercial data use. Recommendations that emerged from the discussion include: development of standardized formats for documenting data reuse, education about the obstacles posed by restrictive licenses, and continued recognition by funding agencies that data management is a critical activity that requires dedicated resources

    The Gene Ontology knowledgebase in 2023

    Get PDF
    The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project

    The Gene Ontology knowledgebase in 2023

    Get PDF
    The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project

    The Gene Ontology resource: enriching a GOld mine

    Get PDF
    The Gene Ontology Consortium (GOC) provides the most comprehensive resource currently available for computable knowledge regarding the functions of genes and gene products. Here, we report the advances of the consortium over the past two years. The new GO-CAM annotation framework was notably improved, and we formalized the model with a computational schema to check and validate the rapidly increasing repository of 2838 GO-CAMs. In addition, we describe the impacts of several collaborations to refine GO and report a 10% increase in the number of GO annotations, a 25% increase in annotated gene products, and over 9,400 new scientific articles annotated. As the project matures, we continue our efforts to review older annotations in light of newer findings, and, to maintain consistency with other ontologies. As a result, 20 000 annotations derived from experimental data were reviewed, corresponding to 2.5% of experimental GO annotations. The website (http://geneontology.org) was redesigned for quick access to documentation, downloads and tools. To maintain an accurate resource and support traceability and reproducibility, we have made available a historical archive covering the past 15 years of GO data with a consistent format and file structure for both the ontology and annotations

    Lancet

    Get PDF
    BACKGROUND: In 2015, the second cycle of the CONCORD programme established global surveillance of cancer survival as a metric of the effectiveness of health systems and to inform global policy on cancer control. CONCORD-3 updates the worldwide surveillance of cancer survival to 2014. METHODS: CONCORD-3 includes individual records for 37.5 million patients diagnosed with cancer during the 15-year period 2000-14. Data were provided by 322 population-based cancer registries in 71 countries and territories, 47 of which provided data with 100% population coverage. The study includes 18 cancers or groups of cancers: oesophagus, stomach, colon, rectum, liver, pancreas, lung, breast (women), cervix, ovary, prostate, and melanoma of the skin in adults, and brain tumours, leukaemias, and lymphomas in both adults and children. Standardised quality control procedures were applied; errors were rectified by the registry concerned. We estimated 5-year net survival. Estimates were age-standardised with the International Cancer Survival Standard weights. FINDINGS: For most cancers, 5-year net survival remains among the highest in the world in the USA and Canada, in Australia and New Zealand, and in Finland, Iceland, Norway, and Sweden. For many cancers, Denmark is closing the survival gap with the other Nordic countries. Survival trends are generally increasing, even for some of the more lethal cancers: in some countries, survival has increased by up to 5% for cancers of the liver, pancreas, and lung. For women diagnosed during 2010-14, 5-year survival for breast cancer is now 89.5% in Australia and 90.2% in the USA, but international differences remain very wide, with levels as low as 66.1% in India. For gastrointestinal cancers, the highest levels of 5-year survival are seen in southeast Asia: in South Korea for cancers of the stomach (68.9%), colon (71.8%), and rectum (71.1%); in Japan for oesophageal cancer (36.0%); and in Taiwan for liver cancer (27.9%). By contrast, in the same world region, survival is generally lower than elsewhere for melanoma of the skin (59.9% in South Korea, 52.1% in Taiwan, and 49.6% in China), and for both lymphoid malignancies (52.5%, 50.5%, and 38.3%) and myeloid malignancies (45.9%, 33.4%, and 24.8%). For children diagnosed during 2010-14, 5-year survival for acute lymphoblastic leukaemia ranged from 49.8% in Ecuador to 95.2% in Finland. 5-year survival from brain tumours in children is higher than for adults but the global range is very wide (from 28.9% in Brazil to nearly 80% in Sweden and Denmark). INTERPRETATION: The CONCORD programme enables timely comparisons of the overall effectiveness of health systems in providing care for 18 cancers that collectively represent 75% of all cancers diagnosed worldwide every year. It contributes to the evidence base for global policy on cancer control. Since 2017, the Organisation for Economic Co-operation and Development has used findings from the CONCORD programme as the official benchmark of cancer survival, among their indicators of the quality of health care in 48 countries worldwide. Governments must recognise population-based cancer registries as key policy tools that can be used to evaluate both the impact of cancer prevention strategies and the effectiveness of health systems for all patients diagnosed with cancer. FUNDING: American Cancer Society; Centers for Disease Control and Prevention; Swiss Re; Swiss Cancer Research foundation; Swiss Cancer League; Institut National du Cancer; La Ligue Contre le Cancer; Rossy Family Foundation; US National Cancer Institute; and the Susan G Komen Foundation

    Editing the genome of hiPSC with CRISPR/Cas9: disease models

    Get PDF

    Worldwide trends in population-based survival for children, adolescents, and young adults diagnosed with leukaemia, by subtype, during 2000–14 (CONCORD-3) : analysis of individual data from 258 cancer registries in 61 countries

    Get PDF
    Background Leukaemias comprise a heterogenous group of haematological malignancies. In CONCORD-3, we analysed data for children (aged 0–14 years) and adults (aged 15–99 years) diagnosed with a haematological malignancy during 2000–14 in 61 countries. Here, we aimed to examine worldwide trends in survival from leukaemia, by age and morphology, in young patients (aged 0–24 years). Methods We analysed data from 258 population-based cancer registries in 61 countries participating in CONCORD-3 that submitted data on patients diagnosed with leukaemia. We grouped patients by age as children (0–14 years), adolescents (15–19 years), and young adults (20–24 years). We categorised leukaemia subtypes according to the International Classification of Childhood Cancer (ICCC-3), updated with International Classification of Diseases for Oncology, third edition (ICD-O-3) codes. We estimated 5-year net survival by age and morphology, with 95% CIs, using the non-parametric Pohar-Perme estimator. To control for background mortality, we used life tables by country or region, single year of age, single calendar year and sex, and, where possible, by race or ethnicity. All-age survival estimates were standardised to the marginal distribution of young people with leukaemia included in the analysis. Findings 164563 young people were included in this analysis: 121328 (73·7%) children, 22963 (14·0%) adolescents, and 20272 (12·3%) young adults. In 2010–14, the most common subtypes were lymphoid leukaemia (28205 [68·2%] patients) and acute myeloid leukaemia (7863 [19·0%] patients). Age-standardised 5-year net survival in children, adolescents, and young adults for all leukaemias combined during 2010–14 varied widely, ranging from 46% in Mexico to more than 85% in Canada, Cyprus, Belgium, Denmark, Finland, and Australia. Individuals with lymphoid leukaemia had better age-standardised survival (from 43% in Ecuador to ≥80% in parts of Europe, North America, Oceania, and Asia) than those with acute myeloid leukaemia (from 32% in Peru to ≥70% in most high-income countries in Europe, North America, and Oceania). Throughout 2000–14, survival from all leukaemias combined remained consistently higher for children than adolescents and young adults, and minimal improvement was seen for adolescents and young adults in most countries. Interpretation This study offers the first worldwide picture of population-based survival from leukaemia in children, adolescents, and young adults. Adolescents and young adults diagnosed with leukaemia continue to have lower survival than children. Trends in survival from leukaemia for adolescents and young adults are important indicators of the quality of cancer management in this age group.peer-reviewe

    Global survival trends for brain tumors, by histology: analysis of individual records for 556,237 adults diagnosed in 59 countries during 2000–2014 (CONCORD-3)

    Get PDF
    Background: Survival is a key metric of the effectiveness of a health system in managing cancer. We set out to provide a comprehensive examination of worldwide variation and trends in survival from brain tumors in adults, by histology. Methods: We analyzed individual data for adults (15–99 years) diagnosed with a brain tumor (ICD-O-3 topography code C71) during 2000–2014, regardless of tumor behavior. Data underwent a 3-phase quality control as part of CONCORD-3. We estimated net survival for 11 histology groups, using the unbiased nonparametric Pohar Perme estimator. Results: The study included 556,237 adults. In 2010–2014, the global range in age-standardized 5-year net survival for the most common sub-types was broad: in the range 20%–38% for diffuse and anaplastic astrocytoma, from 4% to 17% for glioblastoma, and between 32% and 69% for oligodendroglioma. For patients with glioblastoma, the largest gains in survival occurred between 2000–2004 and 2005–2009. These improvements were more noticeable among adults diagnosed aged 40–70 years than among younger adults. Conclusions: To the best of our knowledge, this study provides the largest account to date of global trends in population-based survival for brain tumors by histology in adults. We have highlighted remarkable gains in 5-year survival from glioblastoma since 2005, providing large-scale empirical evidence on the uptake of chemoradiation at population level. Worldwide, survival improvements have been extensive, but some countries still lag behind. Our findings may help clinicians involved in national and international tumor pathway boards to promote initiatives aimed at more extensive implementation of clinical guidelines
    corecore