5 research outputs found

    REQUITE: A prospective multicentre cohort study of patients undergoing radiotherapy for breast, lung or prostate cancer

    Get PDF
    Purpose: REQUITE aimed to establish a resource for multi-national validation of models and biomarkers that predict risk of late toxicity following radiotherapy. The purpose of this article is to provide summary descriptive data. Methods: An international, prospective cohort study recruited cancer patients in 26 hospitals in eight countries between April 2014 and March 2017. Target recruitment was 5300 patients. Eligible patients had breast, prostate or lung cancer and planned potentially curable radiotherapy. Radiotherapy was prescribed according to local regimens, but centres used standardised data collection forms. Pre-treatment blood samples were collected. Patients were followed for a minimum of 12 (lung) or 24 (breast/prostate) months and summary descriptive statistics were generated. Results: The study recruited 2069 breast (99% of target), 1808 prostate (86%) and 561 lung (51%) cancer patients. The centralised, accessible database includes: physician-(47,025 forms) and patient-(54,901) reported outcomes; 11,563 breast photos; 17,107 DICOMs and 12,684 DVHs. Imputed genotype data are available for 4223 patients with European ancestry (1948 breast, 1728 prostate, 547 lung). Radiation-induced lymphocyte apoptosis (RILA) assay data are available for 1319 patients. DNA (n = 4409) and PAXgene tubes (n = 3039) are stored in the centralised biobank. Example prevalences of 2-year (1-year for lung) grade >= 2 CTCAE toxicities are 13% atrophy (breast), 3% rectal bleeding (prostate) and 27% dyspnoea (lung). Conclusion: The comprehensive centralised database and linked biobank is a valuable resource for the radiotherapy community for validating predictive models and biomarkers. Patient summary: Up to half of cancer patients undergo radiation therapy and irradiation of surrounding healthy tissue is unavoidable. Damage to healthy tissue can affect short-and long-term quality-of-life. Not all patients are equally sensitive to radiation "damage" but it is not possible at the moment to identify those who are. REQUITE was established with the aim of trying to understand more about how we could predict radiation sensitivity. The purpose of this paper is to provide an overview and summary of the data and material available. In the REQUITE study 4400 breast, prostate and lung cancer patients filled out questionnaires and donated blood. A large amount of data was collected in the same way. With all these data and samples a database and biobank were created that showed it is possible to collect this kind of information in a standardised way across countries. In the future, our database and linked biobank will be a resource for research and validation of clinical predictors and models of radiation sensitivity. REQUITE will also enable a better understanding of how many people suffer with radiotherapy toxicity

    USPTO Dataset for: Fast Chemical Reaction Condition Suggestion via Rule-Based Classification and Similarity Search

    No full text
    <p>USPTO database that is analyzed with Rxn-INSIGHT (<a href="https://github.com/mrodobbe/Rxn-INSIGHT">https://github.com/mrodobbe/Rxn-INSIGHT</a>).</p><p>This gzip file contains a very large Pandas DataFrame that can be loaded via pd.read_parquet('uspto_rxn_insight.gzip'). Because of the large size of the data, PyArrow version 13.0 must be used. </p><p>To use parquet in Pandas, install PyArrow and fastparquet using pip:</p><p>pip install pyarrow==13.0<br>pip install fastparquet</p><p>Fast Chemical Reaction Condition Suggestion via Rule-Based Classification and Similarity Search</p><p>The challenge of devising pathways for organic synthesis remains a central issue in the field of medicinal chemistry. Over the span of six decades, computer-aided synthesis planning has given rise to a plethora of potent tools for formulating synthetic routes. Nevertheless, a significant expert task still looms: determining the appropriate solvent, catalyst, and reagents when provided with a set of reactants to achieve and optimize the desired product for a specific step in the synthesis process. Typically, chemists identify key functional groups and rings that exert crucial influences at the reaction center, classify reactions into categories, and may assign them names. This research introduces Rxn-INSIGHT, an open-source algorithm based on the bond-electron matrix approach, with the purpose of automating this endeavor. Rxn-INSIGHT not only streamlines the process but also facilitates extensive querying of reaction databases, effectively replicating the thought processes of an organic chemist. The core functions of the algorithm encompass the classification and naming of reactions, extraction of functional groups, rings, and scaffolds from the involved chemical entities, and the provision of reaction condition recommendations based on the similarity and prevalence of reactions. The performance of our rule-based model has been rigorously assessed against a carefully curated benchmark dataset, exhibiting an accuracy rate exceeding 90% in reaction classification and surpassing 95% in reaction naming. Notably, it has been discerned that a pivotal factor in selecting analogous reactions lies in the analysis of ring structures participating in the reactions. An examination of ring structures within the USPTO chemical reaction database reveals that with just 35 unique rings, a remarkable 75% of all rings found in nearly 1 million products can be encompassed. Furthermore, Rxn-INSIGHT is proficient in suggesting appropriate choices for solvents, catalysts, and reagents in entirely novel reactions, all within the span of a second, utilizing nothing more than an everyday laptop.</p&gt

    Active Machine Learning for Chemical Engineers: a Bright Future Lies Ahead!

    No full text
    By combining machine learning with design of experiments, so-called active machine learning, more efficient and cheaper research can be conducted. Machine learning algorithms are more flexible, and are better at investigating the processes spanning all length scales of chemical engineering. While the active machine learning algorithms are maturing, its applications are lacking behind. Three types of challenges faced by active machine learning are identified and ways to overcome them are discussed: the convincing of the experimental researcher, the flexibility of data creation, and the robustness of the active machine learning algorithms. A bright future lies ahead for active machine learning in chemical engineering thanks to increasing automation and more efficient algorithms to drive novel discoveries

    Active Machine Learning for Chemical Engineers: A Bright Future Lies Ahead!

    No full text
    By combining machine learning with the design of experiments, thereby achieving so-called active machine learning, more efficient and cheaper research can be conducted. Machine learning algorithms are more flexible and are better than traditional design of experiment algorithms at investigating processes spanning all length scales of chemical engineering. While active machine learning algorithms are maturing, their applications are falling behind. In this article, three types of challenges presented by active machine learning—namely, convincing the experimental researcher, the flexibility of data creation, and the robustness of active machine learning algorithms—are identified, and ways to overcome them are discussed. A bright future lies ahead for active machine learning in chemical engineering, thanks to increasing automation and more efficient algorithms that can drive novel discoveries

    REQUITE: A prospective multicentre cohort study of patients undergoing radiotherapy for breast, lung or prostate cancer

    No full text
    PURPOSE: REQUITE aimed to establish a resource for multi-national validation of models and biomarkers that predict risk of late toxicity following radiotherapy. The purpose of this article is to provide summary descriptive data. METHODS: An international, prospective cohort study recruited cancer patients in 26 hospitals in eight countries between April 2014 and March 2017. Target recruitment was 5300 patients. Eligible patients had breast, prostate or lung cancer and planned potentially curable radiotherapy. Radiotherapy was prescribed according to local regimens, but centres used standardised data collection forms. Pre-treatment blood samples were collected. Patients were followed for a minimum of 12 (lung) or 24 (breast/prostate) months and summary descriptive statistics were generated. RESULTS: The study recruited 2069 breast (99% of target), 1808 prostate (86%) and 561 lung (51%) cancer patients. The centralised, accessible database includes: physician- (47,025 forms) and patient- (54,901) reported outcomes; 11,563 breast photos; 17,107 DICOMs and 12,684 DVHs. Imputed genotype data are available for 4223 patients with European ancestry (1948 breast, 1728 prostate, 547 lung). Radiation-induced lymphocyte apoptosis (RILA) assay data are available for 1319 patients. DNA (n = 4409) and PAXgene tubes (n = 3039) are stored in the centralised biobank. Example prevalences of 2-year (1-year for lung) grade ≥2 CTCAE toxicities are 13% atrophy (breast), 3% rectal bleeding (prostate) and 27% dyspnoea (lung). CONCLUSION: The comprehensive centralised database and linked biobank is a valuable resource for the radiotherapy community for validating predictive models and biomarkers. PATIENT SUMMARY: Up to half of cancer patients undergo radiation therapy and irradiation of surrounding healthy tissue is unavoidable. Damage to healthy tissue can affect short- and long-term quality-of-life. Not all patients are equally sensitive to radiation "damage" but it is not possible at the moment to identify those who are. REQUITE was established with the aim of trying to understand more about how we could predict radiation sensitivity. The purpose of this paper is to provide an overview and summary of the data and material available. In the REQUITE study 4400 breast, prostate and lung cancer patients filled out questionnaires and donated blood. A large amount of data was collected in the same way. With all these data and samples a database and biobank were created that showed it is possible to collect this kind of information in a standardised way across countries. In the future, our database and linked biobank will be a resource for research and validation of clinical predictors and models of radiation sensitivity. REQUITE will also enable a better understanding of how many people suffer with radiotherapy toxicity.status: publishe
    corecore