15 research outputs found

    Comparative performances of machine learning methods for classifying Crohn Disease patients using genome-wide genotyping data

    Get PDF
    Abstract: Crohn Disease (CD) is a complex genetic disorder for which more than 140 genes have been identified using genome wide association studies (GWAS). However, the genetic architecture of the trait remains largely unknown. The recent development of machine learning (ML) approaches incited us to apply them to classify healthy and diseased people according to their genomic information. The Immunochip dataset containing 18,227 CD patients and 34,050 healthy controls enrolled and genotyped by the international Inflammatory Bowel Disease genetic consortium (IIBDGC) has been re-analyzed using a set of ML methods: penalized logistic regression (LR), gradient boosted trees (GBT) and artificial neural networks (NN). The main score used to compare the methods was the Area Under the ROC Curve (AUC) statistics. The impact of quality control (QC), imputing and coding methods on LR results showed that QC methods and imputation of missing genotypes may artificially increase the scores. At the opposite, neither the patient/control ratio nor marker preselection or coding strategies significantly affected the results. LR methods, including Lasso, Ridge and ElasticNet provided similar results with a maximum AUC of 0.80. GBT methods like XGBoost, LightGBM and CatBoost, together with dense NN with one or more hidden layers, provided similar AUC values, suggesting limited epistatic effects in the genetic architecture of the trait. ML methods detected near all the genetic variants previously identified by GWAS among the best predictors plus additional predictors with lower effects. The robustness and complementarity of the different methods are also studied. Compared to LR, non-linear models such as GBT or NN may provide robust complementary approaches to identify and classify genetic markers

    An Overview of Occupational Risks From Climate Change

    No full text
    Changes in atmosphere and temperature are affecting multiple environmental indicators from extreme heat events to global air quality. Workers will be uniquely affected by climate change, and the occupational impacts of major shifts in atmospheric and weather conditions need greater attention. Climate change-related exposures most likely to differentially affect workers in the USA and globally include heat, ozone, polycyclic aromatic hydrocarbons, other chemicals, pathogenic microorganisms, vector-borne diseases, violence, and wildfires. Epidemiologic evidence documents a U-, J-, or V-shaped relationship between temperature and mortality. Whereas heat-related morbidity and mortality risks are most evident in agriculture, many other outdoor occupational sectors are also at risk, including construction, transportation, landscaping, firefighting, and other emergency response operations. The toxicity of chemicals change under hyperthermic conditions, particularly for pesticides and ozone. Combined with climate-related changes in chemical transport and distribution, these interactions represent unique health risks specifically to workers. Links between heat and interpersonal conflict including violence require attention because they pose threats to the safety of emergency medicine, peacekeeping and humanitarian relief, and public safety professionals. Recommendations for anticipating how US workers will be most susceptible to climate change include formal monitoring systems for agricultural workers; modeling scenarios focusing on occupational impacts of extreme climate events including floods, wildfires, and chemical spills; and national research agenda setting focusing on control and mitigation of occupational susceptibility to climate change
    corecore