36 research outputs found
An environmental signature for 323 microbial genomes based on codon adaptation indices
BACKGROUND: Codon adaptation indices (CAIs) represent an evolutionary strategy to modulate gene expression and have widely been used to predict potentially highly expressed genes within microbial genomes. Here, we evaluate and compare two very different methods for estimating CAI values, one corresponding to translational codon usage bias and the second obtained mathematically by searching for the most dominant codon bias. RESULTS: The level of correlation between these two CAI methods is a simple and intuitive measure of the degree of translational bias in an organism, and from this we confirm that fast replicating bacteria are more likely to have a dominant translational codon usage bias than are slow replicating bacteria, and that this translational codon usage bias may be used for prediction of highly expressed genes. By analyzing more than 300 bacterial genomes, as well as five fungal genomes, we show that codon usage preference provides an environmental signature by which it is possible to group bacteria according to their lifestyle, for instance soil bacteria and soil symbionts, spore formers, enteric bacteria, aquatic bacteria, and intercellular and extracellular pathogens. CONCLUSION: The results and the approach described here may be used to acquire new knowledge regarding species lifestyle and to elucidate relationships between organisms that are far apart evolutionarily
Systematic Characterisation of Cellular Localisation and Expression Profiles of Proteins Containing MHC Ligands
Presentation of peptides on Major Histocompatibility Complex (MHC) molecules is the cornerstone in immune system activation and increased knowledge of the characteristics of MHC ligands and their source proteins is highly desirable.In the present large-scale study, we used a large data set of proteins containing experimentally identified MHC class I or II ligands and examined the proteins according to their expression profiles at the mRNA level and their Gene Ontology (GO) classification within the cellular component ontology. Proteins encoded by highly abundant mRNA were found to be much more likely to be the source of MHC ligands. Of the 2.5% most abundant mRNAs as much as 41% of the proteins encoded by these mRNAs contained MHC class I ligands. For proteins containing MHC class II ligands, the corresponding percentage was 11%. Furthermore, we found that most proteins containing MHC class I ligands were localised to the intracellular parts of the cell including the cytoplasm and nucleus. MHC class II ligand donors were, on the other hand, mostly membrane proteins.The results contribute to the ongoing debate concerning the nature of MHC ligand-containing proteins and can be used to extend the existing methods for MHC ligand predictions by including the source protein's localisation and expression profile. Improving the current methods is important in the growing quest for epitopes that can be used for vaccine or diagnostic purposes, especially when it comes to large DNA viruses and cancer
Sequence-based feature prediction and annotation of proteins
The combination of prediction tools in complex workflows and pipelines facilitates prediction of protein features from sequence
Evolution reveals a glutathione-dependent mechanism of 3-hydroxypropionic acid tolerance
Biologically produced 3-hydroxypropionic acid (3HP) is a potential source for sustainable acrylates and can also find direct use as monomer in the production of biodegradable polymers. For industrial scale production there is a need for robust cell factories tolerant to high concentration of 3HP, preferably at low pH. Through adaptive laboratory evolution we selected S. cerevisiae strains with improved tolerance to 3HP at pH 3.5. Genome sequencing followed by functional analysis identified the causal mutation in SFA1 gene encoding S-(hyclroxymerhyl)glutathione dehydrogenase. Based on our findings, we propose that 3HP toxicity is mediated by 3-hydroxypropionic aldehyde (reuterin ) and that glutathione-dependent reactions are used for reuterin detoxification. The identified molecular response to 3HP and reuterin may well be a general mechanism for handling resistance to organic acid and aldehydes by living cells. (C) 2014 International Metabolic Engineering Society Published by Elsevier Inc. On behalf of International Metabolic Engineering Society. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/3.0/
Improving comparability between microarray probe signals by thermodynamic intensity correction
Signals from different oligonucleotide probes against the same target show great variation in intensities. However, detection of differences along a sequence e.g. to reveal intron/exon architecture, transcription boundary as well as simple absent/present calls depends on comparisons between different probes. It is therefore of great interest to correct for the variation between probes. Much of this variation is sequence dependent. We demonstrate that a thermodynamic model for hybridization of either DNA or RNA to a DNA microarray, which takes the sequence-dependent probe affinities into account significantly reduces the signal fluctuation between probes targeting the same gene transcript. For a test set of tightly tiled yeast genes, the model reduces the variance by up to a factor ∼1/3. As a consequence of this reduction, the model is shown to yield a more accurate determination of transcription start sites for a subset of yeast genes. In another application, we identify present/absent calls for probes hybridized to the sequenced Escherichia coli strain O157:H7 EDL933. The model improves the correct calls from 85 to 95% relative to raw intensity measures. The model thus makes applications which depend on comparisons between probes aimed at different sections of the same target more reliable
Deciphering Diseases and Biological Targets for Environmental Chemicals using Toxicogenomics Networks
Exposure to environmental chemicals and drugs may have a negative effect on human health. A better understanding of the molecular mechanism of such compounds is needed to determine the risk. We present a high confidence human protein-protein association network built upon the integration of chemical toxicology and systems biology. This computational systems chemical biology model reveals uncharacterized connections between compounds and diseases, thus predicting which compounds may be risk factors for human health. Additionally, the network can be used to identify unexpected potential associations between chemicals and proteins. Examples are shown for chemicals associated with breast cancer, lung cancer and necrosis, and potential protein targets for di-ethylhexyl-phthalate, 2,3,7,8-tetrachlorodibenzo-p-dioxin, pirinixic acid and permethrine. The chemical-protein associations are supported through recent published studies, which illustrate the power of our approach that integrates toxicogenomics data with other data types
Minimal information for studies of extracellular vesicles (MISEV2023): From basic to advanced approaches
Extracellular vesicles (EVs), through their complex cargo, can reflect the state of their cell of origin and change the functions and phenotypes of other cells. These features indicate strong biomarker and therapeutic potential and have generated broad interest, as evidenced by the steady year-on-year increase in the numbers of scientific publications about EVs. Important advances have been made in EV metrology and in understanding and applying EV biology. However, hurdles remain to realising the potential of EVs in domains ranging from basic biology to clinical applications due to challenges in EV nomenclature, separation from non-vesicular extracellular particles, characterisation and functional studies. To address the challenges and opportunities in this rapidly evolving field, the International Society for Extracellular Vesicles (ISEV) updates its 'Minimal Information for Studies of Extracellular Vesicles', which was first published in 2014 and then in 2018 as MISEV2014 and MISEV2018, respectively. The goal of the current document, MISEV2023, is to provide researchers with an updated snapshot of available approaches and their advantages and limitations for production, separation and characterisation of EVs from multiple sources, including cell culture, body fluids and solid tissues. In addition to presenting the latest state of the art in basic principles of EV research, this document also covers advanced techniques and approaches that are currently expanding the boundaries of the field. MISEV2023 also includes new sections on EV release and uptake and a brief discussion of in vivo approaches to study EVs. Compiling feedback from ISEV expert task forces and more than 1000 researchers, this document conveys the current state of EV research to facilitate robust scientific discoveries and move the field forward even more rapidly