7 research outputs found

    Multi-modality machine learning predicting Parkinson's disease

    Get PDF
    Personalized medicine promises individualized disease prediction and treatment. The convergence of machine learning (ML) and available multimodal data is key moving forward. We build upon previous work to deliver multimodal predictions of Parkinson's disease (PD) risk and systematically develop a model using GenoML, an automated ML package, to make improved multi-omic predictions of PD, validated in an external cohort. We investigated top features, constructed hypothesis-free disease-relevant networks, and investigated drug-gene interactions. We performed automated ML on multimodal data from the Parkinson's progression marker initiative (PPMI). After selecting the best performing algorithm, all PPMI data was used to tune the selected model. The model was validated in the Parkinson's Disease Biomarker Program (PDBP) dataset. Our initial model showed an area under the curve (AUC) of 89.72% for the diagnosis of PD. The tuned model was then tested for validation on external data (PDBP, AUC 85.03%). Optimizing thresholds for classification increased the diagnosis prediction accuracy and other metrics. Finally, networks were built to identify gene communities specific to PD. Combining data modalities outperforms the single biomarker paradigm. UPSIT and PRS contributed most to the predictive power of the model, but the accuracy of these are supplemented by many smaller effect transcripts and risk SNPs. Our model is best suited to identifying large groups of individuals to monitor within a health registry or biobank to prioritize for further testing. This approach allows complex predictive models to be reproducible and accessible to the community, with the package, code, and results publicly available

    The IPDGC/GP2 Hackathon - an open science event for training in data science, genomics, and collaboration using Parkinson’s disease data

    Get PDF
    Open science and collaboration are necessary to facilitate the advancement of Parkinson's disease (PD) research. Hackathons are collaborative events that bring together people with different skill sets and backgrounds to generate resources and creative solutions to problems. These events can be used as training and networking opportunities, thus we coordinated a virtual 3-day hackathon event, during which 49 early-career scientists from 12 countries built tools and pipelines with a focus on PD. Resources were created with the goal of helping scientists accelerate their own research by having access to the necessary code and tools. Each team was allocated one of nine different projects, each with a different goal. These included developing post-genome-wide association studies (GWAS) analysis pipelines, downstream analysis of genetic variation pipelines, and various visualization tools. Hackathons are a valuable approach to inspire creative thinking, supplement training in data science, and foster collaborative scientific relationships, which are foundational practices for early-career researchers. The resources generated can be used to accelerate research on the genetics of PD

    Defining the causes of sporadic Parkinson's disease in the global Parkinson's genetics program (GP2)

    Get PDF
    The Global Parkinson’s Genetics Program (GP2) will genotype over 150,000 participants from around the world, and integrate genetic and clinical data for use in large-scale analyses to dramatically expand our understanding of the genetic architecture of PD. This report details the workflow for cohort integration into the complex arm of GP2, and together with our outline of the monogenic hub in a companion paper, provides a generalizable blueprint for establishing large scale collaborative research consortia

    Genetics of Parkinson's disease: An introspection of its journey towards precision medicine.

    No full text
    A substantial proportion of risk for Parkinson's disease (PD) is driven by genetics. Progress in understanding the genetic basis of PD has been significant. So far, highly-penetrant rare genetic alterations in SNCA, LRRK2, VPS35, PRKN, PINK1, DJ-1 and GBA have been linked with typical familial PD and common genetic variability at 90 loci have been linked to risk for PD. In this review, we outline the journey thus far of PD genetics, highlighting how significant advances have improved our knowledge of the genetic basis of PD risk, onset and progression. Despite remarkable progress, our field has yet to unravel how genetic risk variants disrupt biological pathways and molecular networks underlying the pathobiology of the disease. We highlight that currently identified genetic risk factors only represent a fraction of the likely genetic risk for PD. Identifying the remaining genetic risk will require us to diversify our efforts, performing genetic studies across different ancestral groups. This work will inform us on the varied genetic basis of disease across populations and also aid in fine mapping discovered loci. If we are able to take this course, we foresee that genetic discoveries in PD will directly influence our ability to predict disease and aid in defining etiological subtypes, critical steps for the implementation of precision medicine for PD

    The IPDGC/GP2 Hackathon - an open science event for training in data science, genomics, and collaboration using Parkinson’s disease data

    No full text
    Abstract Open science and collaboration are necessary to facilitate the advancement of Parkinson’s disease (PD) research. Hackathons are collaborative events that bring together people with different skill sets and backgrounds to generate resources and creative solutions to problems. These events can be used as training and networking opportunities, thus we coordinated a virtual 3-day hackathon event, during which 49 early-career scientists from 12 countries built tools and pipelines with a focus on PD. Resources were created with the goal of helping scientists accelerate their own research by having access to the necessary code and tools. Each team was allocated one of nine different projects, each with a different goal. These included developing post-genome-wide association studies (GWAS) analysis pipelines, downstream analysis of genetic variation pipelines, and various visualization tools. Hackathons are a valuable approach to inspire creative thinking, supplement training in data science, and foster collaborative scientific relationships, which are foundational practices for early-career researchers. The resources generated can be used to accelerate research on the genetics of PD
    corecore