412 research outputs found
Comparing Machine Learning and Logistic Regression Methods for Predicting Hypertension Using a Combination of Gene Expression and Next-Generation Sequencing Data
Machine learning methods continue to show promise in the analysis of data from genetic association studies because of the high number of variables relative to the number of observations. However, few best practices exist for the application of these methods. We extend a recently proposed supervised machine learning approach for predicting disease risk by genotypes to be able to incorporate gene expression data and rare variants. We then apply 2 different versions of the approach (radial and linear support vector machines) to simulated data from Genetic Analysis Workshop 19 and compare performance to logistic regression. Method performance was not radically different across the 3 methods, although the linear support vector machine tended to show small gains in predictive ability relative to a radial support vector machine and logistic regression. Importantly, as the number of genes in the models was increased, even when those genes contained causal rare variants, model predictive ability showed a statistically significant decrease in performance for both the radial support vector machine and logistic regression. The linear support vector machine showed more robust performance to the inclusion of additional genes. Further work is needed to evaluate machine learning approaches on larger samples and to evaluate the relative improvement in model prediction from the incorporation of gene expression data
Recommended from our members
Written submission from the School of Law, Politics and Sociology, University of Sussex (OEU0007)to the Women and equalities Committee inquiry: ensuring strong equalities legislation after EU exit
Undoubtedly, the UKβs equalities legislation has become stronger in recent years providing important protection for people who experience discrimination. Nevertheless, this has happened in the context of widening economic inequality, cuts in public services and restrictions on access to justice β all of which make it harder for victims of discrimination to realise the rights that exist on paper. If the UK leaves the EU, the next few years will be a period of great political, economic and social instability when it will be vital to ensure that protection against discrimination is strengthened not weakened and that a culture of support for equality and human rights is promoted throughout the UK. The rights that must be protected benefit everyone in the UK, not only supporting marginalised people and victims of discrimination but also making workplaces fairer for all and underpinning the legitimacy of our democratic institutions
Toward an Integrated Model of Capsule Regulation in Cryptococcus neoformans
Cryptococcus neoformans is an opportunistic fungal pathogen that causes serious human disease in immunocompromised populations. Its polysaccharide capsule is a key virulence factor which is regulated in response to growth conditions, becoming enlarged in the context of infection. We used microarray analysis of cells stimulated to form capsule over a range of growth conditions to identify a transcriptional signature associated with capsule enlargement. The signature contains 880 genes, is enriched for genes encoding known capsule regulators, and includes many uncharacterized sequences. One uncharacterized sequence encodes a novel regulator of capsule and of fungal virulence. This factor is a homolog of the yeast protein Ada2, a member of the Spt-Ada-Gcn5 Acetyltransferase (SAGA) complex that regulates transcription of stress response genes via histone acetylation. Consistent with this homology, the C. neoformans null mutant exhibits reduced histone H3 lysine 9 acetylation. It is also defective in response to a variety of stress conditions, demonstrating phenotypes that overlap with, but are not identical to, those of other fungi with altered SAGA complexes. The mutant also exhibits significant defects in sexual development and virulence. To establish the role of Ada2 in the broader network of capsule regulation we performed RNA-Seq on strains lacking either Ada2 or one of two other capsule regulators: Cir1 and Nrg1. Analysis of the results suggested that Ada2 functions downstream of both Cir1 and Nrg1 via components of the high osmolarity glycerol (HOG) pathway. To identify direct targets of Ada2, we performed ChIP-Seq analysis of histone acetylation in the Ada2 null mutant. These studies supported the role of Ada2 in the direct regulation of capsule and mating responses and suggested that it may also play a direct role in regulating capsule-independent antiphagocytic virulence factors. These results validate our experimental approach to dissecting capsule regulation and provide multiple targets for future investigation
In situ formation of suspended graphene windows for lab-based XPS in liquid and gas environments
Environmental cells sealed with photoelectron-transparent graphene windows are promising for extending X-ray photoelectron spectroscopy (XPS) to liquid and high-pressure gas environments for in situ and operando studies. However, the reliable production of graphene windows that are sufficiently leak-tight for extended measurements remains a challenge. Here we demonstrate a PDMS/Au(100 nm)-supported transfer method that reliably produces suspended graphene on perforated silicon nitride membranes without significant contamination. A yield of ~95% is achieved based on single-layer graphene covering >98% of the holes in the silicon nitride membrane. Even higher coverages are achieved for stacked bilayer graphene, allowing wet etching (aqueous KI/I2) of the Au support to be observed in a conventional lab-based XPS system, thereby demonstrating the in situ formation of leak-tight, suspended graphene windows. Furthermore, these windows allow gas-phase measurements at close to atmospheric pressure, showing future promise for XPS under higher-pressure gas environments in conventional lab-based systems
High glucose disrupts oligosaccharide recognition function via competitive inhibition : a potential mechanism for immune dysregulation in diabetes mellitus
Diabetic complications include infection and cardiovascular disease. Within the immune system, host-pathogen and regulatory host-host interactions operate through binding of oligosaccharides by C-type lectin. A number of C-type lectins recognise oligosaccharides rich in mannose and fucose β sugars with similar structures to glucose. This raises the possibility that high glucose conditions in diabetes affect protein-oligosaccharide interactions via competitive inhibition. Mannose binding lectin, soluble DC-SIGN & DC-SIGNR, and surfactant protein D, were tested for carbohydrate binding in the presence of glucose concentrations typical of diabetes, via surface plasmon resonance and affinity chromatography. Complement activation assays were performed in high glucose. DC-SIGN and DC-SIGNR expression in adipose tissues was examined via immunohistochemistry. High glucose inhibited C-type lectin binding to high-mannose glycoprotein and binding of DC-SIGN to fucosylated ligand (blood group B) was abrogated in high glucose. Complement activation via the lectin pathway was inhibited in high glucose and also in high trehalose - a nonreducing sugar with glucoside stereochemistry. DC-SIGN staining was seen on cells with DC morphology within omental and subcutaneous adipose tissues. We conclude that high glucose disrupts C-type lectin function, potentially illuminating new perspectives on susceptibility to infectious and inflammatory disease in diabetes. Mechanisms involve competitive inhibition of carbohydrate-binding within sets of defined proteins, in contrast to broadly indiscriminate, irreversible glycation of proteins
Revealing the role of CO during CO2 hydrogenation on Cu surfaces with in situ soft X-ray spectroscopy
The reactions of H2, CO2, and CO gas mixtures on the surface of Cu at 200 Β°C, relevant for industrial methanol synthesis, are investigated using a combination of ambient pressure X-ray photoelectron spectroscopy (AP-XPS) and atmospheric-pressure near edge X-ray absorption fine structure (AtmP-NEXAFS) spectroscopy bridging pressures from 0.1 mbar to 1 bar. We find that the order of gas dosing can critically affect the catalyst chemical state, with the Cu catalyst maintained in a metallic state when H2 is introduced prior to the addition of CO2. Only on increasing the CO2 partial pressure is CuO formation observed that coexists with metallic Cu. When only CO2 is present, the surface oxidizes to Cu2O and CuO, and the subsequent addition of H2 partially reduces the surface to Cu2O without recovering metallic Cu, consistent with a high kinetic barrier to H2 dissociation on Cu2O. The addition of CO to the gas mixture is found to play a key role in removing adsorbed oxygen that otherwise passivates the Cu surface, making metallic Cu surface sites available for CO2 activation and subsequent conversion to CH3OH. These findings are corroborated by mass spectrometry measurements, which show increased H2O formation when H2 is dosed before rather than after CO2. The importance of maintaining metallic Cu sites during the methanol synthesis reaction is thereby highlighted, with the inclusion of CO in the gas feed helping to achieve this even in the absence of ZnO as the catalyst support
Recommended from our members
Machine learning and data mining in complex genomic data a review on the lessons learned in Genetic Analysis Workshop Nineteen
In the analysis of current genomic data, application of machine learning and data mining techniques has become more attractive given the rising complexity of the projects. As part of the Genetic Analysis Workshop 19, approaches from this domain were explored, mostly motivated from two starting points. First, assuming an underlying structure in the genomic data, data mining might identify this and thus improve downstream association analyses. Second, computational methods for machine learning need to be developed further to efficiently deal with the current wealth of data.
In the course of discussing results and experiences from the machine learning and data mining approaches, six common messages were extracted. These depict the current state of these approaches in the application to complex genomic data. Although some challenges remain for future studies, important forward steps were taken in the integration of different data types and the evaluation of the evidence. Mining the data for underlying genetic or phenotypic structure and using this information in subsequent analyses proved to be extremely helpful and is likely to become of even greater use with more complex data sets
- β¦