8 research outputs found

    SLISEMAP: Combining Supervised Dimensionality Reduction with Local Explanations

    Get PDF
    We introduce a Python library, called slisemap, that contains a supervised dimensionality reduction method that can be used for global explanation of black box regression or classification models. slisemap takes a data matrix and predictions from a black box model as input, and outputs a (typically) two-dimensional embedding, such that the black box model can be approximated, to a good fidelity, by the same interpretable white box model for points with similar embeddings. The library includes basic visualisation tools and extensive documentation, making it easy to get started and obtain useful insights. The slisemap library is published on GitHub and PyPI under an open source license.Peer reviewe

    Technical note: Incorporating expert domain knowledge into causal structure discovery workflows

    Get PDF
    In this note, we argue that the outputs of causal discovery algorithms should not usually be considered end results but rather starting points and hypotheses for further study. The incentive to explore this topic came from a recent study by Krich et al. (2020), which gives a good introduction to estimating causal networks in biosphere–atmosphere interaction but confines the scope by investigating the outcome of a single algorithm. We aim to give a broader perspective to this study and to illustrate how not only different algorithms but also different initial states and prior information of possible causal model structures affect the outcome. We provide a proof-of-concept demonstration of how to incorporate expert domain knowledge with causal structure discovery and remark on how to detect and take into account over-fitting and concept drift.Peer reviewe

    Model selection with bootstrap validation

    No full text
    Model selection is one of the most central tasks in supervised learning. Validation set methods are the standard way to accomplish this task: models are trained on training data, and the model with the smallest loss on the validation data is selected. However, it is generally not obvious how much validation data is required to make a reliable selection, which is essential when labeled data are scarce or expensive. We propose a bootstrap-based algorithm, bootstrap validation (BSV), that uses the bootstrap to adjust the validation set size and to find the best-performing model within a tolerance parameter specified by the user. We find that BSV works well in practice and can be used as a drop-in replacement for validation set methods or k-fold cross-validation. The main advantage of BSV is that less validation data is typically needed, so more data can be used to train the model, resulting in better approximations and efficient use of validation data.Peer reviewe

    Sleep apnoea is a risk factor for severe COVID-19

    No full text
    Background Obstructive sleep apnoea (OSA) is associated with higher body mass index (BMI), diabetes, older age and male gender, which are all risk factors for severe COVID-19.We aimed to study if OSA is an independent risk factor for COVID-19 infection or for severe COVID-19.Methods OSA diagnosis and COVID-19 infection were extracted from the hospital discharge, causes of death and infectious diseases registries in individuals who participated in the FinnGen study (n=260 405). Severe COVID-19 was defined as COVID-19 requiring hospitalisation. Multivariate logistic regression model was used to examine association. Comorbidities for either COVID-19 or OSA were selected as covariates. We performed a meta-analysis with previous studies.Results We identified 445 individuals with COVID-19, and 38 (8.5%) of them with OSA of whom 19 out of 91 (20.9%) were hospitalised. OSA associated with COVID-19 hospitalisation independent from age, sex, BMI and comorbidities (p-unadjusted=5.13×10−5, OR-adjusted=2.93 (95% CI 1.02 to 8.39), p-adjusted=0.045). OSA was not associated with the risk of contracting COVID-19 (p=0.25). A meta-analysis of OSA and severe COVID-19 showed association across 15 835 COVID-19 positive controls, and n=1294 patients with OSA with severe COVID-19 (OR=2.37 (95% 1.14 to 4.95), p=0.021).Conclusion Risk for contracting COVID-19 was the same for patients with OSA and those without OSA. In contrast, among COVID-19 positive patients, OSA was associated with higher risk for hospitalisation. Our findings are in line with earlier works and suggest OSA as an independent risk factor for severe COVID-19

    Evidence of a causal effect of genetic tendency to gain muscle mass on uterine leiomyomata

    No full text
    Uterine leiomyomata (UL) are the most common tumours of the female genital tract and the primary cause of surgical removal of the uterus. Genetic factors contribute to UL susceptibility. To add understanding to the heritable genetic risk factors, we conduct a genome-wide association study (GWAS) of UL in up to 426,558 European women from FinnGen and a previous UL meta-GWAS. In addition to the 50 known UL loci, we identify 22 loci that have not been associated with UL in prior studies. UL-associated loci harbour genes enriched for development, growth, and cellular senescence. Of particular interest are the smooth muscle cell differentiation and proliferation-regulating genes functioning on the myocardin-cyclin dependent kinase inhibitor 1A pathway. Our results further suggest that genetic predisposition to increased fat-free mass may be causally related to higher UL risk, underscoring the involvement of altered muscle tissue biology in UL pathophysiology. Overall, our findings add to the understanding of the genetic pathways underlying UL, which may aid in developing novel therapeutics.Peer reviewe

    New insights into the genetic etiology of Alzheimer’s disease and related dementias

    Get PDF
    Characterization of the genetic landscape of Alzheimer’s disease (AD) and related dementias (ADD) provides a unique opportunity for a better understanding of the associated pathophysiological processes. We performed a two-stage genome-wide association study totaling 111,326 clinically diagnosed/‘proxy’ AD cases and 677,663 controls. We found 75 risk loci, of which 42 were new at the time of analysis. Pathway enrichment analyses confirmed the involvement of amyloid/tau pathways and highlighted microglia implication. Gene prioritization in the new loci identified 31 genes that were suggestive of new genetically associated processes, including the tumor necrosis factor alpha pathway through the linear ubiquitin chain assembly complex. We also built a new genetic risk score associated with the risk of future AD/dementia or progression from mild cognitive impairment to AD/dementia. The improvement in prediction led to a 1.6- to 1.9-fold increase in AD risk from the lowest to the highest decile, in addition to effects of age and the APOE ε4 allele

    New insights into the genetic etiology of Alzheimer’s disease and related dementias

    No full text
    Characterization of the genetic landscape of Alzheimer’s disease (AD) and related dementias (ADD) provides a unique opportunity for a better understanding of the associated pathophysiological processes. We performed a two-stage genome-wide association study totaling 111,326 clinically diagnosed/‘proxy’ AD cases and 677,663 controls. We found 75 risk loci, of which 42 were new at the time of analysis. Pathway enrichment analyses confirmed the involvement of amyloid/tau pathways and highlighted microglia implication. Gene prioritization in the new loci identified 31 genes that were suggestive of new genetically associated processes, including the tumor necrosis factor alpha pathway through the linear ubiquitin chain assembly complex. We also built a new genetic risk score associated with the risk of future AD/dementia or progression from mild cognitive impairment to AD/dementia. The improvement in prediction led to a 1.6- to 1.9-fold increase in AD risk from the lowest to the highest decile, in addition to effects of age and the APOE ε4 allele
    corecore