11 research outputs found

    Advancing data science in drug development through an innovative computational framework for data sharing and statistical analysis

    Get PDF
    Background Novartis and the University of Oxford’s Big Data Institute (BDI) have established a research alliance with the aim to improve health care and drug development by making it more efficient and targeted. Using a combination of the latest statistical machine learning technology with an innovative IT platform developed to manage large volumes of anonymised data from numerous data sources and types we plan to identify novel patterns with clinical relevance which cannot be detected by humans alone to identify phenotypes and early predictors of patient disease activity and progression. Method The collaboration focuses on highly complex autoimmune diseases and develops a computational framework to assemble a research-ready dataset across numerous modalities. For the Multiple Sclerosis (MS) project, the collaboration has anonymised and integrated phase II to phase IV clinical and imaging trial data from ≈35,000 patients across all clinical phenotypes and collected in more than 2200 centres worldwide. For the “IL-17” project, the collaboration has anonymised and integrated clinical and imaging data from over 30 phase II and III Cosentyx clinical trials including more than 15,000 patients, suffering from four autoimmune disorders (Psoriasis, Axial Spondyloarthritis, Psoriatic arthritis (PsA) and Rheumatoid arthritis (RA)). Results A fundamental component of successful data analysis and the collaborative development of novel machine learning methods on these rich data sets has been the construction of a research informatics framework that can capture the data at regular intervals where images could be anonymised and integrated with the de-identified clinical data, quality controlled and compiled into a research-ready relational database which would then be available to multi-disciplinary analysts. The collaborative development from a group of software developers, data wranglers, statisticians, clinicians, and domain scientists across both organisations has been key. This framework is innovative, as it facilitates collaborative data management and makes a complicated clinical trial data set from a pharmaceutical company available to academic researchers who become associated with the project. Conclusions An informatics framework has been developed to capture clinical trial data into a pipeline of anonymisation, quality control, data exploration, and subsequent integration into a database. Establishing this framework has been integral to the development of analytical tools

    Soft windowing application to improve analysis of high-throughput phenotyping data.

    Get PDF
    MOTIVATION: High-throughput phenomic projects generate complex data from small treatment and large control groups that increase the power of the analyses but introduce variation over time. A method is needed to utlize a set of temporally local controls that maximizes analytic power while minimizing noise from unspecified environmental factors. RESULTS: Here we introduce \u27soft windowing\u27, a methodological approach that selects a window of time that includes the most appropriate controls for analysis. Using phenotype data from the International Mouse Phenotyping Consortium (IMPC), adaptive windows were applied such that control data collected proximally to mutants were assigned the maximal weight, while data collected earlier or later had less weight. We applied this method to IMPC data and compared the results with those obtained from a standard non-windowed approach. Validation was performed using a resampling approach in which we demonstrate a 10% reduction of false positives from 2.5 million analyses. We applied the method to our production analysis pipeline that establishes genotype-phenotype associations by comparing mutant versus control data. We report an increase of 30% in significant P-values, as well as linkage to 106 versus 99 disease models via phenotype overlap with the soft-windowed and non-windowed approaches, respectively, from a set of 2082 mutant mouse lines. Our method is generalizable and can benefit large-scale human phenomic projects such as the UK Biobank and the All of Us resources. AVAILABILITY AND IMPLEMENTATION: The method is freely available in the R package SmoothWin, available on CRAN http://CRAN.R-project.org/package=SmoothWin. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

    Identification of genes required for eye development by high-throughput screening of mouse knockouts.

    Get PDF
    Despite advances in next generation sequencing technologies, determining the genetic basis of ocular disease remains a major challenge due to the limited access and prohibitive cost of human forward genetics. Thus, less than 4,000 genes currently have available phenotype information for any organ system. Here we report the ophthalmic findings from the International Mouse Phenotyping Consortium, a large-scale functional genetic screen with the goal of generating and phenotyping a null mutant for every mouse gene. Of 4364 genes evaluated, 347 were identified to influence ocular phenotypes, 75% of which are entirely novel in ocular pathology. This discovery greatly increases the current number of genes known to contribute to ophthalmic disease, and it is likely that many of the genes will subsequently prove to be important in human ocular development and disease

    Aromatase inhibitors versus tamoxifen in premenopausal women with oestrogen receptor-positive early-stage breast cancer treated with ovarian suppression: a patient-level meta-analysis of 7030 women from four randomised trials

    Get PDF

    Radiotherapy to regional nodes in early breast cancer: an individual patient data meta-analysis of 14324 women in 16 trials

    No full text
    BackgroundRadiotherapy has become much better targeted since the 1980s, improving both safety and efficacy. In breast cancer, radiotherapy to regional lymph nodes aims to reduce risks of recurrence and death. Its effects have been studied in randomised trials, some before the 1980s and some after. We aimed to assess the effects of regional node radiotherapy in these two eras.MethodsIn this meta-analysis of individual patient data, we sought data from all randomised trials of regional lymph node radiotherapy versus no regional lymph node radiotherapy in women with early breast cancer (including one study that irradiated lymph nodes only if the cancer was right-sided). Trials were identified through the EBCTCG's regular systematic searches of databases including MEDLINE, Embase, the Cochrane Library, and meeting abstracts. Trials were eligible if they began before Jan 1, 2009. The only systematic difference between treatment groups was in regional node radiotherapy (to the internal mammary chain, supraclavicular fossa, or axilla, or any combinations of these). Primary outcomes were recurrence at any site, breast cancer mortality, non-breast-cancer mortality, and all-cause mortality. Data were supplied by trialists and standardised into a format suitable for analysis. A summary of the formatted data was returned to trialists for verification. Log-rank analyses yielded first-event rate ratios (RRs) and confidence intervals.FindingsWe found 17 eligible trials, 16 of which had available data (for 14 324 participants), and one of which (henceforth excluded), had unavailable data (for 165 participants). In the eight newer trials (12 167 patients), which started during 1989–2008, regional node radiotherapy significantly reduced recurrence (rate ratio 0·88, 95% CI 0·81–0·95; p=0·0008). The main effect was on distant recurrence as few regional node recurrences were reported. Radiotherapy significantly reduced breast cancer mortality (RR 0·87, 95% CI 0·80–0·94; p=0·0010), with no significant effect on non-breast-cancer mortality (0·97, 0·84–1·11; p=0·63), leading to significantly reduced all-cause mortality (0·90, 0·84–0·96; p=0·0022). In an illustrative calculation, estimated absolute reductions in 15-year breast cancer mortality were 1·6% for women with no positive axillary nodes, 2·7% for those with one to three positive axillary nodes, and 4·5% for those with four or more positive axillary nodes. In the eight older trials (2157 patients), which started during 1961–78, regional node radiotherapy had little effect on breast cancer mortality (RR 1·04, 95% CI 0·91–1·20; p=0·55), but significantly increased non-breast-cancer mortality (1·42, 1·18–1·71; p=0·00023), with risk mainly after year 20, and all-cause mortality (1·17, 1·04–1·31; p=0·0067).InterpretationRegional node radiotherapy significantly reduced breast cancer mortality and all-cause mortality in trials done after the 1980s, but not in older trials. These contrasting findings could reflect radiotherapy improvements since the 1980s.<br/
    corecore