75 research outputs found

    Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data

    Get PDF
    Abstract: Crohn Disease (CD) is a complex genetic disorder for which more than 140 genes have been identified using genome wide association studies (GWAS). However, the genetic architecture of the trait remains largely unknown. The recent development of machine learning (ML) approaches incited us to apply them to classify healthy and diseased people according to their genomic information. The Immunochip dataset containing 18,227 CD patients and 34,050 healthy controls enrolled and genotyped by the international Inflammatory Bowel Disease genetic consortium (IIBDGC) has been re-analyzed using a set of ML methods: penalized logistic regression (LR), gradient boosted trees (GBT) and artificial neural networks (NN). The main score used to compare the methods was the Area Under the ROC Curve (AUC) statistics. The impact of quality control (QC), imputing and coding methods on LR results showed that QC methods and imputation of missing genotypes may artificially increase the scores. At the opposite, neither the patient/control ratio nor marker preselection or coding strategies significantly affected the results. LR methods, including Lasso, Ridge and ElasticNet provided similar results with a maximum AUC of 0.80. GBT methods like XGBoost, LightGBM and CatBoost, together with dense NN with one or more hidden layers, provided similar AUC values, suggesting limited epistatic effects in the genetic architecture of the trait. ML methods detected near all the genetic variants previously identified by GWAS among the best predictors plus additional predictors with lower effects. The robustness and complementarity of the different methods are also studied. Compared to LR, non-linear models such as GBT or NN may provide robust complementary approaches to identify and classify genetic markers

    Vascular and blood-brain barrier-related changes underlie stress responses and resilience in female mice and depression in human tissue

    Get PDF
    Prevalence, symptoms, and treatment of depression suggest that major depressive disorders (MDD) present sex differences. Social stress-induced neurovascular pathology is associated with depressive symptoms in male mice; however, this association is unclear in females. Here, we report that chronic social and subchronic variable stress promotes blood-brain barrier (BBB) alterations in mood-related brain regions of female mice. Targeted disruption of the BBB in the female prefrontal cortex (PFC) induces anxiety- and depression-like behaviours. By comparing the endothelium cell-specific transcriptomic profiling of the mouse male and female PFC, we identify several pathways and genes involved in maladaptive stress responses and resilience to stress. Furthermore, we confirm that the BBB in the PFC of stressed female mice is leaky. Then, we identify circulating vascular biomarkers of chronic stress, such as soluble E-selectin. Similar changes in circulating soluble E-selectin, BBB gene expression and morphology can be found in blood serum and postmortem brain samples from women diagnosed with MDD. Altogether, we propose that BBB dysfunction plays an important role in modulating stress responses in female mice and possibly MDD

    Improved Approximation Algorithms for Budgeted Allocations

    No full text
    Abstract. We provide a 3/2-approximation algorithm for an offline budgeted allocations problem, an improvement over the e/(e − 1) approximation of Andelman and Mansour [1] and the e/(e − 1) − ɛ approximation (for ɛ ≈ 0.0001) of Feige and Vondrak [5] for the more general Maximum Submodular Welfare (SMW) problem. For a special case of our problem, we improve this ratio to √ 2. Finally, we prove that it is APX-hard. The problem we study has applications to sponsored search auctions.

    On Revenue Maximization in Second-price Ad Auctions

    No full text
    Most recent papers addressing the algorithmic problem of allocating advertisement space for keywords in sponsored search auctions assume that pricing is done via a first-price auction, which does not realistically model the Generalized Second Price (GSP) auction used in practice. Towards the goal of more realistically modeling these auctions, we introduce the Second-Price Ad Auctions problem, in which bidders’ payments are determined by the GSP mechanism. We show that the complexity of the Second-Price Ad Auctions problem is quite different than that of the more studied First-Price Ad Auctions problem. First, unlike the first-price variant, for which small constant-factor approximations are known, it is NP-hard to approximate the Second-Price Ad Auctions problem to any non-trivial factor. Second, this discrepancy extends even to the 0-1 special case that we call the Second-Price Matching problem (2PM). In particular, offline 2PM is APX-hard, and for online 2PM there is no deterministic algorithm achieving a non-trivial competitive ratio and no randomized algorithm achieving a competitive ratio better than 2. This stands in contrast to the results for the analogous special case in the first-price model, the standard bipartite matching problem, which is solvable in polynomial time and which has deterministic and randomized online algorithms achieving better competitive ratios. On the positive side, we provide a 2-approximation for offline 2PM and a 5.083-competitive randomized algorithm for online 2PM. The latter result makes use of a new generalization of a classic result on the performance of the “Ranking” algorithm for online bipartite matching

    Multivariable prediction model for suspected giant cell arteritis: development and validation

    No full text
    Edsel B Ing,1 Gabriela Lahaie Luna,2 Andrew Toren,3 Royce Ing,4 John J Chen,5 Nitika Arora,6 Nurhan Torun,7 Otana A Jakpor,8 J Alexander Fraser,9 Felix J Tyndel,10 Arun NE Sundaram,10 Xinyang Liu,11 Cindy TY Lam,1 Vivek Patel,12 Ezekiel Weis,13 David Jordan,14 Steven Gilberg,14 Christian Pagnoux,15 Martin ten Hove21Department of Ophthalmology and Vision Sciences, University of Toronto Medical School, Toronto, 2Department of Ophthalmology, Queen’s University, Kingston, ON, 3Department of Ophthalmology, University of Laval, Quebec, QC, 4Toronto Eyelid, Strabismus and Orbit Surgery Clinic, Toronto, ON, Canada; 5Mayo Clinic, Department of Ophthalmology and Neurology, 6Mayo Clinic, Department of Ophthalmology, Rochester, MN, 7Department of Surgery, Division of Ophthalmology, Harvard Medical School, Boston, MA, 8Harvard Medical School, Boston, MA, USA; 9Department of Clinical Neurological Sciences and Ophthalmology, Western University, London, 10Department of Medicine, University of Toronto Medical School, Toronto, ON, Canada; 11Department of Medicine, Fudan University Shanghai Medical College, Shanghai, People’s Republic of China; 12Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA; 13Departments of Ophthalmology, Universities of Alberta and Calgary, Edmonton and Calgary, AB, 14Department of Ophthalmology, University of Ottawa, Ottawa, ON, 15Vasculitis Clinic, Mount Sinai Hospital, Toronto, ON, CanadaPurpose: To develop and validate a diagnostic prediction model for patients with suspected giant cell arteritis (GCA).Methods: A retrospective review of records of consecutive adult patients undergoing temporal artery biopsy (TABx) for suspected GCA was conducted at seven university centers. The pathologic diagnosis was considered the final diagnosis. The predictor variables were age, gender, new onset headache, clinical temporal artery abnormality, jaw claudication, ischemic vision loss (VL), diplopia, erythrocyte sedimentation rate (ESR), C-reactive protein (CRP), and platelet level. Multiple imputation was performed for missing data. Logistic regression was used to compare our models with the non-histologic American College of Rheumatology (ACR) GCA classification criteria. Internal validation was performed with 10-fold cross validation and bootstrap techniques. External validation was performed by geographic site.Results: There were 530 complete TABx records: 397 were negative and 133 positive for GCA. Age, jaw claudication, VL, platelets, and log CRP were statistically significant predictors of positive TABx, whereas ESR, gender, headache, and temporal artery abnormality were not. The parsimonious model had a cross-validated bootstrap area under the receiver operating characteristic curve (AUROC) of 0.810 (95% CI =0.766–0.854), geographic external validation AUROC’s in the range of 0.75–0.85, calibration pH–L of 0.812, sensitivity of 43.6%, and specificity of 95.2%, which outperformed the ACR criteria.Conclusion: Our prediction rule with calculator and nomogram aids in the triage of patients with suspected GCA and may decrease the need for TABx in select low-score at-risk subjects. However, misclassification remains a concern.Keywords: temporal artery biopsy, diagnosis, prediction rule, nomogram, giant cell arteritis, validatio
    corecore